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(57) Abstract: The invention provides isolated nucleic acid molecules, designated TANGO 253, which encode proteins contain- 
00 ing Clq domains and which are homologous to a human adipocyte complement-mediated protein precursor, TANGO 257, which 
O encode proteins homologous to the human extracellular molecule olfactomedin, a molecule important in the maintenance, growth 
gg and differentiation of chemosensory cilia of olfactory neurons, INTERCEPT 258, which encode Ig domain-containing proteins that 
|^ exhibit homology to an antigen (A33) expressed in colonic and small bowel epithelium, and TANGO 281, which encode proteins 
^ downregulated in megakaryocytes that fail to express the gata-1 transcription factor (a factor critical for blood cell formation) and 
5 can, therefore, represent direct or indirect gata-1 targets. The invention also provides anti sense nucleic acid molecules, expression 

vectors containing the nucleic acid molecules of the invention, host cells into which the expression vectors have been introduced, and 
Q non-human transgenic animals in which a nucleic acid molecule of the invention has been introduced or disrupted. The invention still 
^ further provides isolated polypeptides, fusin polypeptides, antigenic peptides and antibodies. Diagnostic, screening and therapeutic 
^ methods utilizing compositions of the invention are also provided. 
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SECRETED PROTEINS AND USES 
THEREOF 

This application is a continuation-in-part of U.S. patent application Serial No. 
09/336,536, filed June 18, 2000, the contents of which are incorporated herein by 
reference in its entirety. 

Background of the Invention 

Many secreted proteins, for example, cytokines and cytokine receptors, play a vital 
role in the regulation of cell growth, cell differentiation, and a variety of specific cellular 
responses. A number of medically useful proteins, including erythropoietin, granulocyte- 
macrophage colony stimulating factor, human growth hormone, and various interleukins, 
are secreted proteins. Thus, an important goal in the design and development of new 
therapies is the identification and characterization of secreted and transmembrane proteins 
and the genes which encode them. 

Many secreted proteins are receptors which bind a ligand and transduce an 
intracellular signal, leading to a variety of cellular responses. The identification and 
characterization of such a receptor enables one to identify both the ligands which bind to 
the receptor and the intracellular molecules and signal transduction pathways associated 
with the receptor, permitting one to identify or design modulators of receptor activity, e.g., 
receptor agonists or antagonists and modulators of signal transduction. 

Summary of the Invention 

The present invention is based, at least in part, on the discovery of cDNA 
molecules which encode the TANGO 253, 257 and 281 proteins and the INTERCEPT 258 
protein, all of which are either wholly secreted or transmembrane proteins. 

The TANGO 253 proteins are Clq domain-containing polypeptides that exhibit 
homology to a human adipocyte complement-related protein precursor. 

The TANGO 257 proteins are homologous to the human extracellular molecule 
olfactomedin, a molecule important in the maintenance, growth and differentiation of 
chemosensory cilia of olfactory neurons. 

The INTERCEPT 258 proteins are Ig domain-containing polypeptides that exhibit 
homology to an antigen (A33) expressed in colonic and small bowel epithelium, a protein 
that may represent a cancer cell marker. 
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The TANGO 281 proteins represent proteins downregulated in megakaryocytes 
that fail to express the gata-1 transcription factor (a factor critical for blood cell formation) 
and can, therefore, represent direct or indirect gata-1 targets. 

The TANGO 253, TANGO 257, INTERCEPT 258 and TANGO 281 proteins, 

5 fragments, derivatives, and variants thereof are collectively referred to herein as 

"polypeptides of the invention" or "proteins of the invention." Nucleic acid molecules 
encoding the polypeptides or proteins of the invention are collectively referred to as 
"nucleic acids of the invention." 

The nucleic acids and polypeptides of the present invention are useful as 

10 modulating agents in regulating a variety of cellular processes. Accordingly, in one 

aspect, this invention provides isolated nucleic acid molecules encoding a polypeptide of 
the invention or a biologically active portion thereof The present invention also provides 
nucleic acid molecules which are suitable for use as primers or hybridization probes for 
the detection of nucleic acids encoding a polypeptide of the invention. 

15 The invention features nucleic acid molecules which are at least 30%, 35%, 40%, 

45%, 50%, 55%, 65%, 75%, 85%, 95%, or 98% identical to the nucleotide sequence of 
SEQ ID NO:l, SEQ ID NO:2, or the nucleotide sequence of the cDNA insert of an 
EpT253 clone deposited with ATCC® as Accession Number 207222, or a complement 
thereof. 

20 The invention features nucleic acid molecules which are at least 30%, 35%, 40%, 

45%, 50%, 55%, 65%, 75%, 85%, 95%, or 98% identical to the nucleotide sequence of 
SEQ ID NO:8, SEQ ID NO;9, or the nucleotide sequence of the cDNA insert of an 
EpTm253 clone deposited with ATCC® as Accession Number 207215, or a complement 
thereof. 

25 The invention features nucleic acid molecules which are at least 95% or 98% 

identical to the nucleotide sequence of SEQ ID NO: 15, SEQ ID NO: 16, or the nucleotide 

sequence of the cDNA insert of an EpT257 clone deposited with ATCC® as Accession 

Number 207222, or a complement thereof. 

The invention features nucleic acid molecules which are at least 95% or 98% 
30 identical to the nucleotide sequence of SEQ ID NO:21, SEQ ID NO:22, or the nucleotide 

sequence of the cDNA insert of an EpTm257 clone deposited with ATCC® as Accession 

Number 207217, or a complement thereof. 

The invention features nucleic acid molecules which are at least 45%, 50%, 55%, 

65%, 75%, 85%, 95%, or 98% identical to the nucleotide sequence of SEQ ID NO:26, 
35 SEQ ID NO:27, or the nucleotide sequence of the cDNA insert of an EpT258 clone 

deposited with ATCC® as Accession Number 207222, or a complement thereof. 
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The invention features nucleic acid molecules which are at least 45%, 50%, 55%, 
65%, 75%, 85%, 95%, or 98% identical to the nucleotide sequence ofSEQ ID NO:37, 
SEQ ID NO:38, or the nucleotide sequence of the cDNA insert of an EpTm258 clone 
deposited with ATCC® as Accession Number 207221, or a complement thereof. 
5 The invention features nucleic acid molecules which are at least 30%, 35%, 40%, 

45%, 50%, 55%, 60%, 65%, 75%, 85%, 95%, or 98% identical to the nucleotide sequence 
of SEQ ID NO:46, SEQ ID NO:47, or the nucleotide sequence of the cDNA insert of an 
EpT281 clone deposited with ATCC® as Accession Number 207222, or a complement 
thereof. 

1 0 The invention features nucleic acid molecules which are at least 35%, 40%, 45%, 

50%, 55%, 65%, 75%, 85%, 95%, or 98% identical to the nucleotide sequence of SEQ ID 
NO:56, SEQ ID NO:57, or the nucleotide sequence of the cDNA insert of an EpmT281 
clone deposited with ATCC® as patent deposit Number PTA-224, or a complement 
thereof. 

1 5 The invention features nucleic acid molecules which are at least 30%, 35%, 40%, 

45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95% or 98% identical to the 
nucleotide sequence of SEQ ID NO: 1, 2, 8, 9, 15, 16, 21, 22, 26, 27, 37, 38, 46, 47, 56, 
57, 77, 80, 91, 100, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 
129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 

20 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 
183, 184, 185, 186, 187, 188, 189, 190, 191or 192, a complement thereof, or the non- 
coding strand of EpT 253, EpTm253, EpT257, EpTm257, EpT258, EpTm258, EpT281 or 
EpTm281 cDNA of ATCC® Accession 207222, Accession Number 207215, Accession 
207217, Accession Number 207221, or patent deposit Number PTA-224, wherein said 

25 nucleic acid molecules encode polypeptides or proteins that exhibit at least one structural 
and/or functional feature of a polypeptide of the invention. 

The invention features nucleic acid molecules of at least 450, 500, 550, 600, 650, 
700, 750, 800, 850, 900, 1000, 1 100, 1200 or 1300 contiguous nucleotides of the 
nucleotide sequence of SEQ ID NO:l, the nucleotide sequence of an EpT253 cDNA of 

30 ATCC® Accession Number 207222, or a complement thereof. 

The invention features nucleic acid molecules which include a fragment of at least 
50, 100, 150, 200, 250, 300, 350, 400, 450, 500, 550, 600, 650, 700 or 720 contiguous 
nucleotides of the nucleotide sequence of SEQ ID NO:2, or a complement thereof. 

The invention features nucleic acid molecules which include a fragment of at least 

35 540, 600, 650, 700, 750, 800, 850, 900, 950, 1000, 1 100, 1200 or 1250 contiguous 
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nucleotides of the nucleotide sequence of SEQ ID NO:8 the nucleotide sequence of an 
EpTm253 cDNa of ATCC® Accession Number 207215, or a complement thereof 

The invention features nucleic acid molecules of at least 310, 350, 400, 450, 500, 
550, 600, 650 or 700 contiguous nucleotides of the nucleotide sequence of SEQ ID NO:9, 
5 or a complement thereof. 

The invention features nucleic acid molecules which include a fragment of at least 
1 800 contiguous nucleotides of the nucleotide sequence of SEQ ID NO: 1 5 or its 
complement. 

The invention features nucleic acid molecules which include a fragment of at least 
10 1 1 50 or 1 200 contiguous nucleotides of the nucleotide sequence of SEQ ID NO: 1 6, or its 
complement. 

The invention features nucleic acid molecules which include a fragment of at least 
1 100, 1200, 1300, 1400, 1500, 1600 or 1700 contiguous nucleotides of the nucleotide 
sequence of SEQ ID NO:21 the nucleotide sequence of an EpTm257 cDNA of ATCC® 
1 5 Accession Number 2072 1 7, or a complement thereof. 

The invention features nucleic acid molecules which include a fragment of at least 
1 150 or 1200 contiguous nucleotides of the nucleotide sequence of SEQ ID NO:22, or its 
complement. 

The invention features nucleic acid molecules which include a fragment of at least 
20 420, 450, 500, 600, 700, 800, 900, 1000, 1 100, 1200, 1300, 1400, 1500, 1600, 1700, or 
1 800 contiguous nucleotides of the nucleotide sequence of SEQ ID NO:26 the nucleotide 
sequence of an EpT258 cDNA of ATCC® Accession Number 207222, or a complement 
thereof. 

The invention features nucleic acid molecules which include a fragment of at least 
25 50, 100, 200, 300, 400, 500, 600, 700, 800, 900, 1000, 1 100, 1200, 1300, 1400, 1500, 
1600, 1700 or 1800 contiguous nucleotides of the nucleotide sequence of SEQ ID NO:27, 
or a complement thereof. 

The invention features nucleic acid molecules which include a fragment of at least 
675, 700, 800, 900, 1000, 1100, 1200, 1300, 1400, 1500, 1600, 1700 or 1800 contiguous 
30 nucleotides of the nucleotide sequence of SEQ ID NO:37 the nucleotide sequence of an 
EpTm258 cDNA of ATCC® Accession Number 207221, or a complement thereof. 

The invention features nucleic acid molecules which include a fragment of at least 
500, 600, 700, 800, 900, 1000, 1100, 1200, 1300, 1400, 1500, 1600, 1700 or 1800 
contiguous nucleotides of the nucleotide sequence of SEQ ID NO:38, or a complement 
35 thereof. 
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The invention features nucleic acid molecules which include a fragment of at least 
50, 100, 200, 300, 400, 500, 600, 700, 800, 900, 1000, 1100, 1200, 1300, 1400, 1500, 
1600, 1700 or 1800 contiguous nucleotides of the nucleotide sequence of SEQ ED NO:46 
the nucleotide sequence of an EpT281 cDNA of ATCC® Accession Number 207222, or a 
5 complement thereof. 

The invention features nucleic acid molecules which include a fragment of at least 
50, 100, 200, 300, 400, 500, 600, 700 or 750 contiguous nucleotides of the nucleotide 
sequence of SEQ ID NO:47, or a complement thereof. 

The invention features nucleic acid molecules which include a fragment of at least 
10 550, 600, 700, 800, 900, 1000, 1100, 1200, 1300, 1400, 1500, 1600, 1700, 1800 or 1850 
contiguous nucleotides of the nucleotide sequence of SEQ ID NO:56 the nucleotide 
sequence of an EpTm281 cDNA of ATCC® patent deposit Number PTA-224, or a 
complement thereof. 

The invention features nucleic acid molecules which include a fragment of at least 
15 50, 100, 200, 300, 400, 500, 600 or 700 contiguous nucleotides of the nucleotide sequence 
of SEQ ID NO:57, or a complement thereof. 

The invention features isolated nucleic acid molecules having a nucleotide 
sequence that is at least about 20, 50, 100, 150, 200, 250, 300, 400, 450, 500, 550, 600, 
650, 700 or more contiguous nucleotides identical to the nucleic acid sequence of SEQ ID 
20 NOS: 1, 2, 8, 9, 15, 16, 21, 22, 26, 27, 37, 38, 46, 47, 56, 57, 77, 80, 91, 100, 101, 103, 
105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 
141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 166, 167, 168, 169, 170, 
171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 
189, 190, 191 or 192, or a complement thereof, or the non-coding strand of EpT253, 
25 EpTm253, EpT257, EpTm257, EpT258, EpTm258, EpT281 or EpTm281 cDNA of 
ATCC® Accession 207222, Accession number 207215, Accession Number 207217, 
Accession Number 207221, or patent deposit number PTA-224, wherein said nucleic acid 
molecules encode polypeptides or proteins that exhibit at least one structural and/or 
functional feature of a polypeptide of the invention. 
30 The invention also features nucleic acid molecules which include a nucleotide 

sequence encoding a protein having an amino acid sequence that is at least 40%, 45%, 
50%, 55%, 60%, 65%, 75%, 85%, 95%, or 98% identical to the amino acid sequence of 
SEQ ID NO:3, the amino acid sequence encoded by an EpT253 cDNA of ATCC® 
Accession Number 207222, or a complement thereof. 
35 The invention also features nucleic acid molecules which include a nucleotide 

sequence encoding a protein having an amino acid sequence that is at least 95%, or 98% 
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identical to the amino acid sequence of SEQ ID NO: 10, the amino acid sequence encoded 
by an EpTm253 cDNA of ATCC® Accession Number 2071 15, or a complement thereof 

The invention also features nucleic acid molecules which include a nucleotide 
sequence encoding a protein having an amino acid sequence that is at least 88%, 90%, 
5 95% or 98% identical to the amino acid sequence of SEQ ID NO: 1 7, the amino acid 
sequence encoded by an EpT257 cDNA of ATCC® Accession Number 207222, or a 
complement thereof. 

The invention also features nucleic acid molecules which include a nucleotide 
sequence encoding a protein having an amino acid sequence that is at least 88%, 90%, 
10 95%, or 98% identical to the amino acid sequence of SEQ ID NO:23, the amino acid 
sequence encoded by an EpTm257 cDNA of ATCC® Accession Number 2071 17, or a 
complement thereof. 

The invention also features nucleic acid molecules which include a nucleotide 
sequence encoding a protein having an amino acid sequence that is at least 45%, 50%, 
15 55%, 60%, 65%, 75%, 85%, 95%, or 98% identical to the amino acid sequence of SEQ ID 
NO:28, the amino acid sequence encoded by an EpT258 cDNA of ATCC® Accession 
Number 207222, or a complement thereof. 

The invention also features nucleic acid molecules which include a nucleotide 
sequence encoding a protein having an amino acid sequence that is at least 45%, 50%, 
20 55%, 60%, 65%, 75%, 85%, 95%, or 98% identical to the amino acid sequence of SEQ ID 
NO:39, the amino acid sequence encoded by an EpTm258 cDNA of ATCC® Accession 
Number 20722 1 , or a complement thereof. 

The invention also features nucleic acid molecules which include a nucleotide 
sequence encoding a protein having an amino acid sequence that is at least 30%, 35%, 
25 40%, 45%, 50%, 55%, 60%, 65%, 75%, 85%, 95%, or 98% identical to the amino acid 
sequence of SEQ ID NO:48, the amino acid sequence encoded by an EpT281 cDNA of 
ATCC® Accession Number 207222, or a complement thereof. 

The invention also features nucleic acid molecules which include a nucleotide 
sequence encoding a protein having an amino acid sequence that is at least 30%, 35%, 
30 40%, 45%, 50%, 55%, 60%, 65%, 75%, 85%, 95%, or 98% identical to the amino acid 
sequence of SEQ ID NO:58, the amino acid sequence encoded by an EpTm281 of 
ATCC® patent deposit Number PTA-224, or a complement thereof. 

The invention also features nucleic acid molecules which include a nucleotide 
sequence encoding a polypeptide or protein having an amino acid sequence that is at least 
35 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 75%, 85%, 95%, or 98% identical to the 
amino acid sequence of SEQ ID NO:3, 10, 17, 23, 28, 39, 48, or 58, the amino acid 
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sequence encoded by EpT253, EpTm253, EpT257, EpTm257, EpT258, EpTm258, 
EpT281, or EpTm281 of ATCC® Accession Number 207222, Accession Number 207215, 
Accession Number 207217, or Accession Number 207221, patent deposit Number PTA- 
224, or a complement thereof, wherein the polypeptide or protein encoded by the 
5 nucleotide sequence also exhibits at least one structural and/or functional feature of a 
polypeptide of the invention. 

In preferred embodiments, the nucleic acid molecules have the nucleotide sequence 
of SEQ ID NO:l, 2, 8, 9, 15, 16, 21, 22, 26, 27, 37, 38, 46, 47, 56 or 57, or the nucleotide 
sequence of the cDNA clones of ATCC® Accession Number 207222, 207215, 207217, 
10 207221, 207222, or PTA-224. 

Also within the invention are nucleic acid molecules which encode a fragment of a 
polypeptide having the amino acid sequence of SEQ ID NO:3, or a fragment including at 
least 10, 15, 20, 25, 30, 50, 75, 100, 125, 150, 175, 200, 225, 230 or 240 contiguous amino 
acids of SEQ ID NO:3, or the amino acid sequence encoded by an EpT253 cDNA of 
1 5 ATCC® Accession Number 207222. 

Also within the invention are nucleic acid molecules which encode a fragment of a 
polypeptide having the amino acid sequence of SEQ ID NO: 17, or a fragment including at 
least 10, 15, 20, 25, 30, 50, 75, 100, 125, 150, 175, 200, 225, 230 or 240 contiguous amino 
acids of SEQ ID NO: 10, or the amino acid sequence encoded by an EpTm253 cDNA of 
20 ATCC® Accession Number 2072 1 5 . 

Also within the invention are nucleic acid molecules which encode a fragment of a 
polypeptide having the amino acid sequence of SEQ ID NO: 10, or a fragment including at 
least 360, 370, 380, 390 or 400 contiguous amino acids of SEQ ID NO:17, or the amino 
acid sequence encoded by an EpT257 cDNA of ATCC® Accession Number 207222. 
25 Also within the invention are nucleic acid molecules which encode a fragment of a 

polypeptide having the amino acid sequence of SEQ ID NO:23, or a fragment including at 
least 360, 370, 380, 390 or 400 contiguous amino acids of SEQ ID NO:23, or the amino 
acid sequence encoded by an EpTm257 cDNA of ATCC® Accession Number 207217. 

Also within the invention are nucleic acid molecules which encode a fragment of a 
30 polypeptide having the amino acid sequence of SEQ ID NO:3, or a fragment including at 
least 15, 25, 30, 50, 75, 100, 125, 150, 175, 200, 225, 250, 275, 300, 350 or 360 
contiguous amino acids of SEQ ID NO:28, or the amino acid sequence encoded by an 
EpT258 cDNA of ATCC® Accession Number 207222. 

Also within the invention are nucleic acid molecules which encode a fragment of a 
35 polypeptide having the amino acid sequence of SEQ ID NO:39, or a fragment including at 
least 160, 175, 200, 225, 250, 275, 300, 350, 375 or 385 contiguous amino acids of SEQ 
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ID NO:39, or the amino acid sequence encoded by an EpT258 cDNA of ATCC® 

Accession Number 207221 . 

Also within the invention are nucleic acid molecules which encode a fragment of a 

polypeptide having the amino acid sequence of SEQ ID NO:48, or a fragment including at 
5 least 15, 25, 30, 50, 75, 100, 125, 150, 175, 200, 225, 235 or 240 contiguous amino acids 

of SEQ ID NO:48, or the amino acid sequence encoded by an EpT281 cDNA of ATCC® 

Accession Number 207222. 

Also within the invention are nucleic acid molecules which encode a fragment of a 

polypeptide having the amino acid sequence of SEQ ID NO:58, or a fragment including at 
10 least 15, 25, 30, 50, 75, 100, 125, 150, 175 or 200 contiguous amino acids of SEQ ID 

NO:58, or the amino acid sequence encoded by an EpTm281 cDNA of ATCC® patent 

deposit Number PTA-224. 

The invention also features nucleic acid molecules which encode a polypeptide 

fragment of at least 15, 25, 30, 50, 75, 100, 125, 150, 175, 200 or more contiguous amino 
15 acids of SEQ ID NO:3, 10, 17, 23, 28, 39, 48 or 58, or the amino acid sequence encoded 

by EpT253, EpTm253, EpT257, EpTm257, EpT258, EpTm258, EpT281 or EpTm281 of 

ATCC® Accession Number 207222, Accession Number 207215, Accession Number 
. 207217, Accession Number 207221 or patent deposit Number PTA-224, wherein the 

fragment also exhibits at least one structural and/or functional feature of a polypeptide of 
20 the invention. 

The invention includes nucleic acid molecules which encode a naturally occurring 
allelic variant of a polypeptide comprising the amino acid sequence of SEQ ID NO:3, 10, 
17, 23, 28, 39, 48, 58, 102, 104, 106, 108, 110, 112, 114, 1.16, 118, 120, 122, 124, 126, 
128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162 

25 or 164, or the amino acid sequence encoded by a cDNA of ATCC® Accession Number 
207222, Accession Number 207215, Accession Number 207217, Accession Number 
207221 or patent deposit Number PTA-224, wherein the nucleic acid molecule hybridizes 
to a nucleic acid molecule consisting of a nucleic acid sequence encoding SEQ ID NO:3, 
10, 28, 39, 48, 58, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 

30 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162 or 
164, or the amino acid sequence encoded by a cDNA of ATCC® Accession Number 
207222, Accession Number 207215, Accession Number 207217, Accession Number 
207221 or patent deposit Number PTA-224, or a complement thereof under stringent 
conditions. 

35 Also within the invention are isolated polypeptides or proteins having an amino 

acid sequence that is at least about 40%, preferably 45%, 55%, 65%, 75%, 85%, 95% or 
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98% identical to the amino acid sequence of SEQ ID NO:3, or the amino acid sequence 
encoded by an EpT253 cDNA of ATCC® Accession Number 207222. 

Also within the invention are isolated polypeptides or proteins having an amino 
acid sequence that is at least about 40%, preferably 45%, 50%, 55%, 65%, 75%, 85%, 
5 95% or 98% identical to the amino acid sequence of SEQ ID NO: 1 0, or the amino acid 
sequence encoded by an EpTm253 cDNA of ATCC® Accession Number 2072 15. 

Also within the invention are isolated polypeptides or proteins having an amino 
acid sequence that is at least 88%, 90%, 95% or 98% identical to the amino acid sequence 
of SEQ ID NO: 17, or the amino acid sequence encoded by an EpT257 cDNA of ATCC® 
1 0 Accession Number 207222. 

Also within the invention are isolated polypeptides or proteins having an amino 
acid sequence that is at least 88%; 90%, 95% or 98% identical to the amino acid sequence 
of SEQ ID NO:23, or the amino acid sequence encoded by an EpTm257 cDNA of 
ATCC® Accession Number 20721 7. 
1 5 Also within the invention are isolated polypeptides or proteins having an amino 

acid sequence that is at least about 30%, preferably 35%, 45%, 55%, 65%, 75%, 85%, 
95% or 98% identical to the amino acid sequence of SEQ ID NO:28, or the amino acid 
sequence encoded by an EpT258 cDNA of ATCC® Accession Number 207222. 

Also within the invention are isolated polypeptides or proteins having an amino 
20 acid sequence that is at least about 30%, preferably 35%, 40%, 45%, 50%, 55%, 65%, 
75%, 85%, 95% or 98% identical to the amino acid sequence of SEQ ID NO:39, or the 
amino acid sequence encoded by an EpTm258 cDNA of ATCC® Accession Number 
207221. 

Also within the invention are isolated polypeptides or proteins having an amino 
25 acid sequence that is at least about 30%, preferably 35%, 45%, 55%, 65%, 75%, 85%, 
95% or 98% identical to the amino acid sequence of SEQ ID NO:48, or the amino acid 
sequence encoded by an EpT281 cDNA of ATCC® Accession Number 207222. 

Also within the invention are isolated polypeptides or proteins having an amino 
acid sequence that is at least about 30%, preferably 35%, 40%, 45%, 50%, 55%, 65%, 
30 75%, 85%, 95% or 98% identical to the amino acid sequence of SEQ ID NO:58, or the 
amino acid sequence encoded by an EpTm281 cDNA of ATCC® patent deposit Number 
PTA-224. 

The invention also features isolated polypeptides or proteins having an amino acid 
sequence that is at least about 30%, preferably 35%, 40%, 45%, 50%, 55%, 65%, 75%, 
35 85%, 95% or 98% identical to the amino acid sequence of SEQ ID NO:3, 10, 17, 23, 28, 
39, 48 or 58, or the amino acid sequence encoded by EpT253, EpTm253, EpT257, 
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EpTm257, EpT258, EpTm258, EpT281 or EpTm281 of ATCC® Accession Number 
207222, Accession Number 207215, Accession Number 207217, Accession Number 
20722 1 , patent deposit Number PTA-224, wherein the protein or polypeptides also 
exhibit at least one structural and/or functional feature of a polypeptide of the invention. 

5 Also within the invention are isolated polypeptides or proteins which are encoded 

by a nucleic acid molecule having a nucleotide sequence that is at least about 30%, 
preferably 35%, 40%, 45%, 50%, 55%, 60%, 65%, 75%, 85%, 95% or 98% identical to 
the nucleic acid sequence encoding SEQ ID NO:3, and isolated polypeptides or proteins 
which are encoded by a nucleic acid molecule having a nucleotide sequence which 

10 hybridizes under stringent hybridization conditions to a nucleic acid molecule having the 
nucleotide sequence of SEQ ID NO: 1 or SEQ ID NO:2, a complement thereof, or the non- 
coding strand of an EpT253 cDNA of ATCC® Accession Number 207222. 

Also within the invention are isolated polypeptides or proteins which are encoded 
by a nucleic acid molecule having a nucleotide sequence that is at least about 30%, 

15 preferably 35%, 40%, 45%, 50%, 55%, 60%, 65%, 75%, 85%, 95% or 98% identical to 
the nucleic acid sequence encoding SEQ ID NO: 10, and isolated polypeptides or proteins 
which are encoded by a nucleic acid molecule having a nucleotide sequence which 
hybridizes under stringent hybridization conditions to a nucleic acid molecule having the 
nucleotide sequence of SEQ ID NO:8 or SEQ ID NO:9, a complement thereof, or the non- 
20 coding strand of an EpTm253 cDNA of ATCC® Accession Number 207215. 

Also within the invention are isolated polypeptides or proteins which are encoded 
by a nucleic acid molecule having a nucleotide sequence that is at least about 45%, 50%, 
55%, 60%, 65%, 75%, 85%, 95% or 98% identical to the nucleic acid sequence encoding 
SEQ ID NO:28, and isolated polypeptides or proteins which are encoded by a nucleic acid 

25 molecule having a nucleotide sequence which hybridizes under stringent hybridization 
conditions to a nucleic acid molecule having the nucleotide sequence of SEQ ID NO:26 or 
SEQ ID NO:27, a complement thereof, or the non-coding strand of an EpT258 cDNA of 
ATCC® Accession Number 207222. 

Also within the invention are isolated polypeptides or proteins which are encoded 

30 by a nucleic acid molecule having a nucleotide sequence that is at least about 45%, 50%, 
55%, 60%, 65%, 75%, 85%, 95% or 98% identical to the nucleic acid sequence encoding 
SEQ ID NO:39, and isolated polypeptides or proteins which are encoded by a nucleic acid 
molecule having a nucleotide sequence which hybridizes under stringent hybridization 
conditions to a nucleic acid molecule having the nucleotide sequence of SEQ ID NO:37 or 

35 SEQ ID NO:38, a complement thereof, or the non-coding strand of an EpTm258 cDNA of 
ATCC® Accession Number 207221. 
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Also within the invention are isolated polypeptides or proteins which are encoded 
by a nucleic acid molecule having a nucleotide sequence that is at least about 30%, 
preferably 35%, 40%, 45%, 50%, 55%, 60%, 65%, 75%, 85%, 95% or 98% identical to 
the nucleic acid sequence encoding SEQ ID NO:48, and isolated polypeptides or proteins 
5 which are encoded by a nucleic acid molecule having a nucleotide sequence which 

hybridizes under stringent hybridization conditions to a nucleic acid molecule having the 
nucleotide sequence of SEQ ID NO:46 or SEQ ID NO:47, a complement thereof, or the 
non-coding strand of an EpT28 1 cDNA of ATCC® Accession Number 207222. 

Also within the invention are isolated polypeptides or proteins which are encoded 
10 by a nucleic acid molecule having a nucleotide sequence that is at least about 30%, 

preferably 35%, 40%, 45%, 50%, 55%, 60%, 65%, 75%, 85%, 95% or 98% identical to 
the nucleic acid sequence encoding SEQ ID NO:58, and isolated polypeptides or proteins 
which are encoded by a nucleic acid molecule having a nucleotide sequence which 
hybridizes under stringent hybridization conditions to a nucleic acid molecule having the 

1 5 nucleotide sequence of SEQ ID NO:56 or SEQ ID NO:57, a complement thereof, or the 
non-coding strand of an EpTm281 cDNA of ATCC® patent deposit Number PTA-224. 

The invention also features isolated polypeptides or proteins which are encoded by 
a nucleic acid molecule having a nucleotide sequence that is at least about 30%, preferably 
35%, 40%, 45%, 50%, 55%, 60%, 65%, 75%, 85%, 95% or 98% identical to a nucleic 

20 acid sequence encoding SEQ ID NO:3, 10, 17, 23, 28, 39, 48, 58, 102, 104, 106, 108, 110, 
112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 
148, 150, 152, 154, 156, 158, 160, 162 or 164, isolated polypeptides or proteins which are 
encoded by a nucleic acid molecule having a nucleotide sequence which hybridizes under 
stringent hybridization conditions to a nucleic acid molecule having the nucleotide 

25 sequence of SEQ ID NO:l, 2, 8, 9, 15, 16, 21, 22, 26, 27, 37, 38, 46, 47, 56, 57, 77, 101, 
103, 104, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 
137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 166, 167, 168, 
169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 
1 87, 1 88, 1 89, 190, 191 or 192, a complement thereof, or the non-coding strand of 

30 EpT253, EpTm253, EpT257, EpTm257, EpT258, EpTm258, EpT281, EpTm281 of 
ATCC® Accession Number 207222, Accession Number 207215, Accession Number 
207217, Accession Number 207221, patent deposit Number PTA-224, wherein 
polypeptides or proteins also exhibit at least one structural and/or functional feature of a 
polypeptide of the invention. 

35 Also within the invention are polypeptides which are naturally occurring allelic 

variants of a polypeptide that includes the amino acid sequence of SEQ ID NO:3, 10, 17, 



- 11 - 



WO 00/78808 



PCT/US00/16883 



23, 28, 39, 48 or 58, or the amino acid sequence encoded by a cDNA of ATCC® 
Accession Number 207222, Accession Number 207215, Accession Number 207217 
Accession Number 207221, or patent deposit Number PTA-224, wherein the polypeptide 
is encoded by a nucleic acid molecule which hybridizes to a nucleic acid molecule having 

5 the sequence of SEQ ID NO: 1, 2, 8, 9, 15, 16, 21, 22, 26, 27, 37, 38, 46, 47, 56 or 57, or a 
complement thereof under stringent conditions. 

The invention also features nucleic acid molecules that hybridize under stringent 
conditions to a nucleic acid molecule having the nucleotide sequence of SEQ ID NO:l or 
2, or an EpT253 cDNA of ATCC® Accession Number 207222, or a complement thereof. 

10 In other embodiments, the nucleic acid molecules are at least 450, 500, 550, 600, 650, 
700, 750, 800, 1000, 1 100, 1200 or 1300 contiguous nucleotides in length and hybridize 
under stringent conditions to a nucleic acid molecule comprising the nucleotide sequence 
of SEQ ID NO:l or 2, an EpT253 cDNA of ATCC® Accession Number 207222, or a 
complement thereof. 

15 The invention also features nucleic acid molecules that hybridize under stringent 

conditions to a nucleic acid molecule having the nucleotide sequence of SEQ ID NO: 8 or 
SEQ ID NO:9, an EpTm253 cDNA of ATCC® Accession Number 207215, or a 
complement thereof. In other embodiments, the nucleic acid molecules are at least 540, 
550, 600, 650, 700, 750, 800, 850, 900, 950, 1000, 1050, 1100, 1159, 1200, or 1250 

20 contiguous nucleotides in length and hybridize under stringent conditions to a nucleic acid 
molecule comprising the nucleotide sequence of SEQ ID NO:8 or SEQ ID NO:9, an 
EpTm253 cDNA of ATCC® Accession Number 207215, or a complement thereof. 

The invention also features nucleic acid molecules that hybridize under stringent 
conditions to a nucleic acid molecule having the nucleotide sequence of SEQ ID NO: 15 or 

25 SEQ ID NO: 1 6, an EpT257 cDNA of ATCC® Accession Number 207222, or a 

complement thereof and encode a polypeptide comprising the amino acid sequence of 
SEQ ID NO:17, or encode a polypeptide comprising at least 360, 370, 380, 390 or 400 
contiguous amino acids or SEQ ID NO: 17. 

The invention also features nucleic acid molecules that hybridize under stringent 

30 conditions to a nucleic acid molecule having the nucleotide sequence of SEQ ID NO:21 or 
SEQ ID NO:22, an EpTm257 cDNA of ATCC® Accession Number 207217, or a 
complement thereof, and encode a polypeptide comprising the amino acid sequence of 
SEQ ID NO:23, or a polypeptide comprising at least 360, 370, 380, 390, or 400 
contiguous amino acids of SEQ ID NO:23. 

35 The invention also features nucleic acid molecules that hybridize under stringent 

conditions to a nucleic acid molecule having the nucleotide sequence of SEQ ID NO:26 or 



-12- 



WO 00/78808 



PCT/US00/16883 



SEQ ID NO:27, an EpT258 cDNA of ATCC® Accession Number 207222, or a 
complement thereof In other embodiments, the nucleic acid molecules are at least 550, 
600, 700, 800, 900, 1000, 1100, 1200, 1300, 1400, 1500, 1600, 1700 or 1800 contiguous 
nucleotides in length and hybridize under stringent conditions to a nucleic acid molecule 

5 comprising the nucleotide sequence of SEQ ID NO:26 or SEQ ID NO:27, an EpT258 
cDNA of ATCC® Accession Number 207222, or a complement thereof. 

The invention also features nucleic acid molecules that hybridize under stringent 
conditions to a nucleic acid molecule having the nucleotide sequence of SEQ ID NO:37 or 
SEQ ID NO:38, an EpTm258 cDNA of ATCC® Accession Number 207221, or a 

10 complement thereof. In other embodiments, the nucleic acid molecules are at least 650, 
700, 800, 900, 1000, 1100, 1200, 1300, 1400, 1500, 1600, 1700 or 1 800 contiguous 
nucleotides in length and hybridize under stringent conditions to a nucleic acid molecule 
comprising the nucleotide sequence of SEQ ID NO:37 or SEQ ID NO:38, an EpTm258 
cDNA of ATCC® Accession Number 207221, or a complement thereof. 

15 The invention also features nucleic acid molecules that hybridize under stringent 

conditions to a nucleic acid molecule having the nucleotide sequence of SEQ ID NO:46 or 
47, an EpTm281 cDNA of ATCC® Accession Number 207222, or a complement thereof. 
In other embodiments, the nucleic acid molecules are at least 710, 750, 800, 900, 1000, 
1 100, 1200, 1300, 1400, 1500, 1600, 1700 or 1800 contiguous nucleotides in length and 

20 hybridize under stringent conditions to a nucleic acid molecule comprising the nucleotide 
sequence of SEQ ID NO:46 or SEQ ID NO:47, an EpT281 cDNA of ATCC® Accession 
Number 207222, or a complement thereof. 

The invention also features nucleic acid molecules that hybridize under stringent 
conditions to a nucleic acid molecule having the nucleotide sequence of SEQ ID NO:56 or 

25 57, an EpTm281 cDNA of ATCC® patent deposit Number PTA-224, or a complement 
thereof. In other embodiments, the nucleic acid molecules are at least 580, 600, 700, 800, 
900, 1000, 1100, 1200, 1300, 1400, 1500, 1600, 1700, 1800 or 1850 contiguous 
nucleotides in length and hybridize under stringent conditions to a nucleic acid molecule 
comprising the nucleotide sequence of SEQ ID NO:56 or SEQ ID NO:57, an EpTm281 

30 cDNA of ATCC® patent deposit Number PTA-224, or a complement thereof. 

The invention also features nucleic acid molecules that hybridize under stringent 
conditions to a nucleic acid molecule having the nucleotide sequence of SEQ ID NO: 1 , 2, 
8, 9, 15, 16, 21, 22, 26, 27, 37, 38, 46, 47, 56, 57, 77, 101, 103, 105, 107, 109, 111,113, 
115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 

35 151, 153, 155, 157, 159, 161, 163, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 
176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191 or 192, or a 
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nucleotide sequence of EpT253, EpTm253, EpT257, EpTm257, EpT258, EpTm258, 
EpT281 or EpTm281 of ATCC® Accession Number 207222, Accession Number 207215, 
Accession Number 207217, Accession Number 207221, patent deposit Number PTA-224, 
or complement thereof, wherein such nucleic acid molecules encode polypeptides or 
5 proteins that exhibit at least one structural and/or functional feature of a polypeptide of the 
invention. 

The invention also features nucleic acid molecules at least 15, preferably at least 
50, at least 75, at least 100, at least 150, at least 200, at least 250, at least 300, at least 350, 
at least 400, at least 500, at least 600, at least 700, at least 800, at least 1000, at least 1 100 

10 or at least 1200 or more contiguous nucleotides in length which hybridize under stringent 
conditions to a nucleic acid molecule comprising the nucleotide sequence of SEQ ID 
NO:l, 2, 8, 9, 15, 16, 21, 22, 26, 27, 37, 38, 46, 47, 56, 57, 77, 80, 91, 100, 101, 103, 104, 
105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 
141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 166, 167, 168, 169, 170, 

15 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 
189, 190, 191 or 192, or a nucleotide sequence of EpT253, EpTm253, EpT257,.EpTm257, 
EpT258, EpTm258, EpT281 or EpTm281 of ATCC® Accession Number 207222, 
Accession Number 207215, Accession Number 207217, Accession Number 207221, 
patent deposit Number PTA-224, or a complement thereof, wherein said nucleic acid 

20 molecules encode polypeptides or proteins that exhibit at least one structural and/or 
functional feature of a polypeptide of the invention. 

In one embodiment, the invention provides an isolated nucleic acid molecule 
which is antisense to the coding strand of a nucleic acid of the invention. 

Another aspect of the invention provides vectors, e.g., recombinant expression 

25 vectors, comprising a nucleic acid molecule of the invention. In another embodiment, the 
invention provides host cells containing such a vector or engineered to contain and/or 
express a nucleic acid molecule of the invention. The invention also provides methods for 
producing a polypeptide of the invention by culturing, in a suitable medium, a host cell of 
the invention such that a polypeptide of the invention is produced. 

30 Another aspect of this invention features isolated or recombinant proteins and 

polypeptides of the invention. Preferred proteins and polypeptides possess at least one 
biological activity possessed by the corresponding naturally-occurring human polypeptide. 
An activity, a biological activity, or a functional activity of a polypeptide or nucleic acid 
of the invention refers to an activity exerted by a protein, polypeptide or nucleic acid 

35 molecule of the invention on a responsive cell as determined in vivo or in vitro, according 
to standard techniques. Such activities can be a direct activity, such as an association with 
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or an enzymatic activity on a second protein, or an indirect activity, such as a cellular 
signaling activity mediated by interaction of the protein with a second protein. 

For TANGO 253, biological activities include, e.g., (1) the ability to modulate (this 
term, as used herein, includes, but is not limited to, "stabilize", promote, inhibit or disrupt, 

5 protein-protein interactions (e.g., homophilic and/or heterophilic), and protein-ligand 
interactions, e.g., in receptor-ligand recognition; (2) the ability to modulate the 
development, differentiation, maturation, proliferation and/or activity of cells of the 
central nervous system such as neurons, glial cells (e.g., astrocytes and oligodendrocytes), 
and Schwann cells; (3) the ability to modulate the development of central nervous system; 

10 (4) the ability to modulate the development, differentiation, maturation, proliferation 
and/or activity of renal cells; (5) the ability to modulate the development, differentiation, 
maturation, proliferation and/or activity of testical cells, such as germ cells, leydig cells 
and Sertoli cells; (6) the ability to modulate the development, differentiation, maturation, 
proliferation and/or activity of ovarian cells; (7) ability to modulate cell-cell interactions 

15 and/or cell-extracellular matrix interactions; (8) the ability to modulate the host immune 
response, e.g., by modulating one or more elements in the serum complement cascade; (9) 
the ability to modulate the proliferation, differentiation and/or activity of cells that form 
blood vessels and coronary tissue (e.g., coronary smooth muscle cells and/or blood vessel 
endothelial cells); (10) the ability to modulate intracellular signaling cascades (e.g., signal 

20 transduction cascades); and (1 1) the ability to modulate adipocyte function. 

For TANGO 257, biological activities include, e.g., (1) the ability to modulate the 
development, differentiation, proliferation and/or activity of neuronal cells, e.g., olfactory 
neurons (2) the ability to modulate the development, differentiation, proliferation and/or 
activity of pulmonary system cells, e,g., lung cell types; (4) the ability to modulate the 

25 development, differentiation, maturation, proliferation and/or activity of bone cells such as 
osteocytes, osteoblasts and osteoclasts (e.g., the ability promote the development of 
osteocytes); (5) the ability to modulate the development of bone structures such as the 
skull, the basisphenoid bone, the upper and lower incisor teeth, the vertebral column, the 
sternum, the scapula, and the femur during embryogenesis; (6) the ability to modulate the 

30 development, differentiation, maturation, proliferation and/or activity of renal cells; (7) the 
ability to modulate the development, differentiation, maturation, proliferation and/or 
activity of intestinal cells such as M cells; (8) the ability to modulate cell-cell interactions 
and/or cell-extracellular matrix interactions, e.g., neuronal cell-extracellular matrix 
interactions; (9) the ability to modulate cell proliferation, e.g., abnormal cell proliferation; 

35 and (10) the ability to modulate the development, differentiation, proliferation and/or 
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activity of cells that form blood vessels and coronary tissue, e.g., coronary smooth muscle 
cells and/or blood vessel endothelial cells. 

For INTERCEPT 258, biological activities include, e.g., (1) the ability to modulate 
protein-protein interactions (e.g., homophilic and/or heterophilic), and protein-ligand 

5 interactions, e.g., in receptor-ligand recognition; (2) the ability to modulate cell-cell 
interactions; (3) the ability to modulate the host immune response; (4) the ability to 
modulate the development, differentiation, maturation, proliferation and/or activity of 
pulmonary system cells such as bronchial cells; (5) the ability to modulate the 
development, differentiation, maturation, proliferation and/or activity of renal cells; (5) the 

10 ability to modulate the development, differentiation, maturation, proliferation and/or 
activity of cardiac cells such cardiac myocytes; (6) the ability to modulate the 
development of brown fat (e.g., the promotion of the development of brown fat); (7) the 
ability to modulate the development, differentiation, maturation, proliferation and/or 
activity of endothelial cells; (8) the ability to modulate cell proliferation, e.g., 

15 gastrointestinal tract epithelial cell proliferation; (9) the ability to modulate intracellular 
signaling cascades (e.g., signal transduction cascades); and (10) the ability to modulate 
thrombosis (e.g., the ability to facilitate the removal of blood clots) and/or vascularization 
(e.g., the promotion of vascularization). 

For TANGO 281, biological activities include, e.g., (1) the ability to modulate, 

20 e.g-> stabilize, promote, inhibit or disrupt protein-protein interactions (e.g., homophilic 
and/or heterophilic), and protein-ligand interactions, e.g., in receptor-ligand recognition; 
(2) the ability to modulate cell-cell interactions; (3) the ability to modulate the host 
immune response; (4) the ability to modulate the proliferation, differentiation and/or 
activity of hematopoeitic cells (e.g. megakaryocytes); (5) the ability to modulate the 

25 development, differentiation, maturation, proliferation and/or activity of pulmonary 
system cells; (6) the ability to modulate the development, differentiation, maturation, 
proliferation and/or activity intestinal cells such as M cells; (7) the ability to modulate the 
development, differentiation, maturation, proliferation and/or activity of stomach cells 
such as cells of the gastric epithelium; (8) the ability to modulate intracellular signaling 

30 cascades (e.g., signal transduction cascades); and (9) the ability to modulate platelet 
function (e.g., the promotion of platelet aggregation). 

In one embodiment, a polypeptide of the invention has an amino acid sequence 
sufficiently identical to an identified domain of a polypeptide of the invention. As used 
herein, the term "sufficiently identical" refers to a first amino acid or nucleotide sequence 

35 which contains a sufficient or minimum number of identical or equivalent (e.g., with a 

similar side chain) amino acid residues or nucleotides to a second amino acid or nucleotide 
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sequence such that the first and second amino acid or nucleotide sequences have or encode 
a common structural domain and/or common functional activity. For example, amino acid 
or nucleotide sequences which contain or encode a common structural domain having 
about 60% identity, preferably 65% identity, more preferably 75%, 85%, 95%, 98% or 
5 more identity are defined herein as sufficiently identical. 

In one embodiment, a TANGO 253 protein includes at least one or more of the 
following domains: a signal sequence, a collagen domain and a Clq domain. 

In one embodiment, a TANGO 257 protein includes at least a signal peptide. 
In one embodiment, an INTERCEPT 258 includes at least one or more of the 
10 following domains: a signal sequence, an extracellular domain, an immunoglobulin (Ig) 
domain, a transmembrane domain, and an intracellular or cytoplasmic domain. 

In one embodiment, a TANGO 281 protein includes at least one or more of the 
following domains: a signal sequence, an extracellular domain, a photosystem II 10 kD 
phosphoprotein domain, a transmembrane domain, and an intracellular or cytoplasmic 
1 5 domain. 

The polypeptides of the present invention, or biologically active portions thereof, 
can be operably linked to a heterologous amino acid sequence to form fusion proteins. 
The invention further features antibodies, such as monoclonal or polyclonal antibodies, 
that specifically bind a polypeptide of the invention. In addition, the polypeptides of the 

20 invention or biologically active portions thereof can be incorporated into pharmaceutical 
compositions, which optionally include pharmaceutically acceptable carriers. 

In another aspect, the present invention provides methods for detecting the 
presence, activity or expression of a polypeptide of the invention in a biological sample by 
contacting the biological sample with an agent capable of detecting an indicator of the 

25 presence, activity or expression such that the presence activity or expression of a 
polypeptide of the invention is detected in the biological sample. 

In another aspect, the invention provides methods for modulating activity of a 
polypeptide of the invention comprising contacting a cell with an agent that modulates 
(inhibits or stimulates) the activity or expression of a polypeptide of the invention such 

30 that activity or expression in the cell is modulated. In one embodiment, the agent is an 
antibody that specifically binds to a polypeptide of the invention. 

In another embodiment, the agent modulates expression of a polypeptide of the 
invention by modulating transcription, splicing, or translation of an mRNA encoding a 
polypeptide of the invention. In yet another embodiment, the agent is a nucleic acid 

35 molecule having a nucleotide sequence that is antisense to the coding strand of an mRNA 
encoding a polypeptide of the invention. 
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The present invention also provides methods to treat a subject having a disorder 
characterized by aberrant activity of a polypeptide of the invention or aberrant expression 
of a nucleic acid of the invention by administering an agent which is a modulator of the 
activity of a polypeptide of the invention or a modulator of the expression of a nucleic acid 

5 of the invention to the subject. In one embodiment, the modulator is a protein of the 
invention. In another embodiment, the modulator is a nucleic acid of the invention. In 
other embodiments, the modulator is a peptide, peptidomimetic, or other small molecule. 

The present invention also provides diagnostic assays for identifying the presence 
or absence of a genetic lesion or mutation characterized by at least one of: (i) aberrant 

10 modification or mutation of a gene encoding a polypeptide of the invention; (ii) mis- 
regulation of a gene encoding a polypeptide of the invention; and (iii) aberrant post- 
translational modification of the invention wherein a wild-type form of the gene encodes a 
protein having the activity of the polypeptide of the invention. 

In another aspect, the invention provides a method for identifying a compound that 

15 binds to or modulates the activity of a polypeptide of the invention. In general, such 
methods entail measuring a biological activity of the polypeptide in the presence and 
absence of a test compound and identifying those compounds which alter the activity of 
the polypeptide. 

The invention also features methods for identifying a compound which modulates 
20 the expression of a polypeptide or nucleic acid of the invention by measuring the 
expression of the polypeptide or nucleic acid in the presence and absence of the 
compound. 

In another aspect, the invention provides substantially purified antibodies or 
fragments thereof, including human, humanized, chimeric and non-human antibodies or 

25 fragments thereof, which antibodies or fragments specifically bind to a polypeptide 

comprising an amino acid sequence of SEQ ID NO: 3, 10, 17, 23, 28, 39, 48, 58, 102, 104, 
106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 
142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162 or 164, or the amino acid sequence 
encoded by the EpT253, EpTm253, EpT257, EpTm257, EpT258, EpTm258, EpT281 or 

30 EpTm281 cDNA insert of the plasmid deposited with the ATCC® as Accession Number 
207222, Accession Number 207215, Accession number 207217, Accession number 
20722 1 , or patent deposit Number PTA-224. 

In another aspect, the invention provides substantially purified antibodies or 
fragments thereof, including, e.g., human, non-human, chimeric and humanized 

35 antibodies, which antibodies or fragments thereof specifically bind to a polypeptide 

comprising at least 15 contiguous amino acids of the amino acid sequence of SEQ ID NO: 
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3, 10, 17, 23, 28, 39,48, 58, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 
126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 
162 or 164, or the amino acid sequence encoded by the EpT253, EpTm253, EpT257, 
EpTm257, EpT258, EpTm258, EpT281 or EpTm281 cDNA insert of the plasmid 

5 deposited with the ATCC® as Accession Number 207222, Accession number 207215, 
Accession number 207217, Accession number 207221, or patent deposit number PTA- 
224, or a complement thereof. 

In another aspect, the invention provides substantially purified antibodies or 
fragments thereof, including, e.g., human, non-human, chimeric and humanized 

10 antibodies, which antibodies or fragments thereof specifically bind to a polypeptide 

comprising at least 95% identical to the amino acid sequence of SEQ ID NO: 3, 10, 17, 23, 
28, 39, 48,58, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 
132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162 or 164, or 
the amino acid sequence encoded by the EpT253, EpTm253, EpT257, EpTm257, EpT258, 

1 5 EpTm258, EpT28 1 or EpTm28 1 cDNA insert of the plasmid deposited with the ATCC® 
as Accession Number 207222, Accession number 207215, Accession number 207217, 
Accession number 207221, or patent deposit number PTA-224, or a complement thereof. 

In another aspect, the invention provides substantially purified antibodies or 
fragments thereof, including, e.g., human, non-human, chimeric and humanized 

20 antibodies, which antibodies or fragments thereof specifically bind to a polypeptide 

encoded by a nucleic acid molecule which hybridizes to the nucleic acid molecule of SEQ 
ID NO:l, 2, 8, 9, 15, 16, 21, 22, 26, 27, 37, 38, 46, 47, 56, 57, 77, 80, 91, 100, 101, 103, 
104, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 
139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 166, 167, 168, 169, 

25 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 
188, 189, 190, 191 or 192 under conditions of hybridization of 6 X SSC at 45°C and 
washing in 0.2 X SSC, 0.1% SDS at 65 °C. 

Any of the antibodies of the invention can be conjugated to a therapeutic moiety or 
to a detectable substance. Non-limiting examples of detectable substances that can be 
30 conjugated to the antibodies of the invention are an enzyme, a prosthetic group, a 

fluorescent material, a luminescent material, a bioluminescent material, and a radioactive 
material. 

The invention also provides a kit containing an antibody of the invention 
conjugated to a detectable substance, and instructions for use. Still another aspect of the 
35 invention is a pharmaceutical composition comprising an antibody of the invention and a 
pharmaceutical^ acceptable carrier. In preferred embodiments, the pharmaceutical 
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composition contains an antibody of the invention, a therapeutic moiety, and a 
pharmaceutical^ acceptable carrier. 

Other features and advantages of the invention will be apparent from the following 
detailed description and claims. 

5 

Brief Description of the Drawings 

FIGURES 1 A-AB depict the cDNA sequence of human TANGO 253 (SEQ ID 
NO:l) and the predicted amino acid sequence of human TANGO 253 (SEQ ID NO:3). 
The open reading frame of SEQ ID NO:l extends from nucleotide 188 to nucleotide 916 

1 0 of SEQ ID NO: 1 (SEQ ID NO:2). 

FIGURE 2 depicts a hydropathy plot of human TANGO 253. Relatively 
hydrophobic regions of the protein are above the dashed horizontal line, and relatively 
hydrophilic regions of the protein are below the dashed horizontal line. The cysteine 
residues (cys) are indicated by short vertical lines just below the hydropathy trace. The 

15 dashed vertical line separates the signal sequence (amino acids 1 to 15 of SEQ ID NO:3; 
SEQ ID NO:5) on the left from the mature protein (amino acids 16 to 243 of SEQ ID 
NO:3; SEQ ID NO:4) on the right. Below the hydropathy plot, the amino acid sequence 
of human TANGO 253 is depicted. 

FIGURES 3A-3B depict a cDNA sequence of mouse TANGO 253 (SEQ ID NO:8) 

20 and the predicted amino acid sequences of mouse TANGO 253 (SEQ ID NO: 10). The 
open reading frame of SEQ ID NO: 10 extends from nucleotide 135 to 863 of SEQ ID 
NO:10(SEQIDNO:9). 

FIGURE 4 depicts a hydropathy plot of mouse TANGO 253. Relatively 
hydrophobic regions of the protein are shown above the dashed horizontal line, and 

25 relatively hydrophilic regions of the protein are below the dashed horizontal line. The 

cysteine residues (cys) are indicated by short vertical lines just below the hydropathy trace. 
The dashed vertical line separates the signal sequence (amino acids 1 to 15 of SEQ ID 
NO:10; SEQ ID NO:12) on the left from the mature protein (amino acids 16 to 243 of 
SEQ ID NO:10; SEQ ID NO:l 1) on the right. Below the hydropathy plot, the amino acid 

30 sequence of mouse TANGO 253 is depicted. 

FIGURE 5 depicts an alignment of the amino acid sequence of human TANGO 
253 (SEQ ID NO:3) and the amino acid sequence of mouse TANGO 253 (SEQ ID 
NO: 10). The alignment demonstrates that the amino acid sequences of human and mouse 
TANGO 253 are 93.8% identical. This alignment was performed using the ALIGN 

35 program with a PAM120 scoring matrix, a gap length penalty of 12 and a gap penalty of 4. 
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FIGURES 6A-6B depict alignments of the amino acid sequence of human 
adipocyte complement-mediated protein precursor (SEQ ID NO:20; Swiss Prot Accession 
Number Q15848) and the amino acid sequence of human TANGO 253 (SEQ ID NO:3; 
6A) or mouse TANGO 253 (SEQ ID NO: 10; 6B). 6A shows the amino acid sequences of 

5 human adipocyte complement-mediated protein precursor and human TANGO 253 are 
38.7% identical. 6B shows the amino acid sequences of human adipocyte complement- 
mediated precursor precursor protein and mouse TANGO 253 are 38.3% identical. These 
alignments were performed using the ALIGN alignment program with a PAM120 scoring 
matrix, a gap length penalty of 12, and a gap penalty of 4. 

1 0 FIGURES 7A-7C depict alignments of the nucleotide sequence of human 

adipocyte complement-mediated protein precursor (SEQ ID NO:32; GenBank Accession 
Number A1417523) and the nucleotide sequence of human TANGO 253 (SEQ ID NO:l) . 
The nucleotide sequences of human adipocyte complement-mediated protein precursor 
and human TANGO 253 are 29.1% identical. These alignments were performed using the 

1 5 ALIGN alignment program with a PAM 1 20 scoring matrix, a gap length penalty of 1 2, 
and a gap penalty of 4. 

FIGURES 8A-8C depict alignments of the nucleotide sequence of human 
adipocyte complement-mediated protein precursor (SEQ ID NO:32; GenBank Accession 
Number A1417523) and the nucleotide sequence of mouse TANGO 253 (SEQ ID NO:8). 

20 The nucleotide sequences of human adipocyte complement-mediated protein precursor 
and mouse TANGO 253 are 30.4% identical. These alignments were performed using the 
ALIGN alignment program with a PAM120 scoring matrix, a gap length penalty of 12, 
and a gap penalty of 4. 

FIGURES 9A-9B depict the cDNA sequence of human TANGO 257 (SEQ ID 

25 NO: 1 5) and the predicted amino acid sequence of human TANGO 257 (SEQ ID NO: 1 7). 
The open reading frame of SEQ ID NO:16 extends from nucleotide 88 to nucleotide 1305 
of SEQ ID NO: 1 5 (SEQ ID NO: 1 6). 

FIGURE 10 depicts a hydropathy plot of human TANGO 257. Relatively 
hydrophobic regions of the protein are shown above the dashed horizontal line, and 

30 relatively hydrophilic regions of the protein are below the dashed horizontal line. The 
cysteine residues (cys) and potential N-glycosylation sites (Ngly) are indicated by short 
vertical lines just below the hydropathy trace. The dashed vertical line separates the signal 
sequence (amino acids 1 to 21 of SEQ ID NO: 16; SEQ ID NO: 19) on the left from the 
mature protein (amino acids 22 to 406 of SEQ ID NO: 16; SEQ ID NO: 18) on the right. 

35 Below the hydropathy plot, the amino acid sequence of human TANGO 257 is depicted. 
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FIGURES 1 1 A-l IB depict a cDNA sequence of mouse TANGO 257 (SEQ ID 
NO:21) and the predicted amino acid sequence of mouse TANGO 257 (SEQ ID NO:23). 
The open reading frame of SEQ ID NO:21 extends from nucleotide 31 to 1248 of SEQ ID 
NO:21 (SEQIDNO:22). 

5 FIGURE 12 depicts a hydropathy plot of mouse TANGO 257. Relatively 

hydrophobic regions of the protein are shown above the dashed horizontal line, and 
relatively hydrophilic regions of the protein are below the dashed horizontal line. The 
cysteine residues (cys) and potential N-glycosylation sites (Ngly) are indicated by short 
vertical lines just below the hydropathy trace. The dashed vertical line separates the signal 

10 sequence (amino acids 1 to 21 of SEQ ID NO:23; SEQ ID NO:25) on the left from the 
mature protein (amino acids 22 to 406 of SEQ ID NO:23; SEQ ID NO:24) on the right. 
Below the hydropathy plot, the amino acid sequence of mouse TANGO 257 is depicted. 

FIGURE 13 depicts an alignment of the amino acid sequence of human TANGO 
257 (SEQ ID NO: 17) and the amino acid sequence of mouse TANGO 257 (SEQ ID 

1 5 NO:23). This alignment demonstrates that the amino acid sequences of human and mouse 
TANGO 257 are 94.1% identical. This alignment was performed using the ALIGN 
program with a PAM120 scoring matrix, a gap length penalty of 12 and a gap penalty of 4. 

FIGURE 14 depicts an alignment of the amino acid sequence (SEQ ID NO:43) 
encoded by a nucleotide sequence referred to in PCT publication WO 98/39446 as "gene 

20 64", and the amino acid sequence of human TANGO 257 (SEQ ID NO:17). Gene 64 
encodes a 353 amino acid residue protein that exhibits homology with the human 
extracellular molecule olfactomedin, which is though to be involved in maintenance, 
growth and/or differentiation of chemosensory cilia on the apical dendrites of olfactory 
neurons. The polypeptide encoded by gene 64 also exhibits homology to human TANGO 

25 257, which contains 406 amino acids (i.e., an additional 53 amino acids carboxy to residue 
353). The amino acid sequences of amino acid residues 1-353 of the gene 64-encoded 
polypeptide and human TANGO 257 are identical. As such, the overall amino acid 
sequence identity between the full length polypeptide encoded by gene 64, and the full- 
length human TANGO 257 polypeptide is approximately 87%. This alignment was 

30 performed using the ALIGN alignment program with a PAM1 20 scoring matrix, a gap 
length penalty of 12, and a gap penalty of 4. 

FIGURES 15 A- 15D depict an alignment of the nucleotide sequence of gene 64 
(SEQ ID NO:66; PCT Publication WO 98/39446) and the nucleotide sequence of human 
TANGO 257 (SEQ ID NO: 15). The nucleotide sequences of gene 64 and human 

35 TANGO 257 are 93.5% identical. It is noted, however, that among the differences 
between the two sequences is a cytosine nucleotide at human TANGO 257 (SEQ ID 
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NO: 15) position 1 146 that results in a human TANGO 257 amino acid sequence (SEQ ID 
NO: 17) of 406 amino acids as opposed to the gene 64 amino acid sequence of only 353 
amino acids (SEQ ED NO:43). Alignment of the nucleotide sequence of the gene 64 open 
reading frame and that of human TANGO 257 (SEQ ID NO: 16) show that the two 
5 nucleotide sequences are 87.2% identical. These alignments were performed using the 
ALIGN program with a PAM220 scoring matrix, a gap length penalty of 12 and a gap 
penalty of 4. 

FIGURE 16 depicts an alignment of the acid sequence of the gene 64-encoded 
polypeptide (SEQ ID NO:43) and the amino acid sequence of mouse TANGO 257 (SEQ 

10 ED NO:23). The sequences exhibit an overall amino acid sequence identity of 

approximately 81.8%. This alignment was performed using an ALIGN program with a 
PAM120 scoring matrix, a gap length penalty of 12 and a gap penalty of 4. 

FIGURE 17A-17C depicts an alignment of the nucleotide sequence of gene 64 
(SEQ ID NO:66) and the nucleotide sequence of mouse TANGO 257 (SEQ ID NO:21). 

15 The two sequences are approximately 76.2% identical. Alignment of the nucleotide 
sequence of the gene 64 open reading frame and that of mouse TANGO 257 (SEQ ID 
NO:22) show that the two nucleotide sequences are 77.8% identical. These alignments 
were performed using the ALIGN program with a PAM220 scoring matrix, a gap length 
penalty of 12 and a gap penalty of 4. 

20 FIGURES 18A-18B depict the cDNA sequence of human INTERCEPT 258 (SEQ 

ID NO:26) and the predicted amino acid sequence of INTERCEPT 258 (SEQ ID NO:28). 
The open reading frame of SEQ ID NO:26 extends from nucleotide 153 to nucleotide 
1262 of SEQ ID NO:26 (SEQ ID NO:27). 

FIGURE 19 depicts a hydropathy plot of human INTERCEPT 258. Relatively 

25 hydrophobic regions of the protein are above the dashed horizontal line, and relatively 
hydrophilic regions of the protein are below the dashed horizontal line. The cysteine 
residues (Cys) and potential N-glycosylation sites (Ngly) are indicated by short vertical 
lines just below the hydropathy trace. Below the hydropathy plot, the amino acid 
sequence of human INTERCEPT 258 is depicted. 

30 FIGURES 20A-20B depict a cDNA sequence of mouse INTERCEPT 258 (SEQ ID 

NO:37) and the predicted amino acid sequence of mouse INTERCEPT 258 (SEQ ID 
NO:39). The open reading frame of SEQ ID NO:37 extends from nucleotide 107 TO 1288 
of SEQ ID NO:60 (SEQ ID NO:38). 

FIGURE 21 depicts a hydropathy plot of mouse INTERCEPT 258. Relatively 

35 hydrophobic regions of the protein are shown above the dashed horizontal line, and 
relatively hydrophilic regions of the protein are below the dashed horizontal line. The 
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cysteine residues (cys) and potential N-glycosylation sites (Ngly) are indicated by short 
vertical lines just below the hydropathy trace. The dashed vertical line separates the signal 
sequence (amino acids 1 to 29 of SEQ ID NO:39; SEQ ID NO:4I) on the left from the 
mature protein (amino acids 30 to 394 of SEQ ED NO:39; SEQ ID NO:40) on the right. 
5 Below the hydropathy plot, the amino acid sequence of mouse INTERCEPT 258 is 
depicted. 

FIGURE 22 depicts an alignment of the amino acid sequence of human 
INTERCEPT 258 (SEQ ID NO:28) and the amino acid sequence of mouse INTERCEPT 
258 (SEQ ID NO:39). The alignment demonstrates that the amino acid sequences of 
10 human and mouse INTERCEPT 258 are 62.8% identical. This alignment was performed 
using the ALIGN program with a PAM120 scoring matrix, a gap length penalty of 12 and 
a gap penalty of 4. 

FIGURE 23 depicts an alignment of the amino acid sequence of human A33 
antigen (SEQ ID NO:67; Swiss Prot Accession Number Q99795) and the amino acid 

15 sequence of human INTERCEPT 258 (SEQ ID NO:28). The A33 antigen is a 

transmembrane glycoprotein and member of the Ig superfamily that may be a cancer cell 
marker. The amino acid sequences of A33 antigen and human INTERCEPT 258 are 23% 
identical. This alignment was performed using the ALIGN alignment program with a 
PAM120 scoring matrix, a gap length penalty of 12, and a gap penalty of 4. 

20 FIGURES 24A-24D depict an alignment of the nucleotide sequence of human A3 3 

antigen (SEQ ID NO:68; Gen Bank Accession Number U79725) and the nucleotide 
sequence of human INTERCEPT 258 (SEQ ID NO:26). These two nucleotide sequences 
are 40.6% identical. The nucleotide sequence of the open reading frame of human A33 
antigen and that of human INTERCEPT 258 are 44% identical. These alignments were 

25 performed using the ALIGN alignment program with a PAM120 scoring matrix, a gap 
length penalty of 12, and a gap penalty of 4. 

FIGURE 25 depicts an alignment of the amino acid sequence of human A33 
antigen (SEQ ID NO:67; Swiss Prot Accession Number Q99795) and the amino acid 
sequence of mouse INTERCEPT 258 (SEQ ID NO:39). These two amino acid sequences 

30 have an overall amino acid identity of 23%. This alignment was performed using the 
ALIGN alignment program with a PAM120 scoring matrix, a gap length penalty of 12, 
and a gap penalty of 4. 

FIGURES 26A-26D depict an alignment of the nucleotide sequence of human A3 3 
antigen (SEQ ID NO:68; GenBank Accession Number U79725) and the nucleotide 

35 sequence of mouse INTERCEPT 258 (SEQ ID NO:37). These two nucleotide sequences 
are 40% identical. The nucleotide sequence of the open reading frame of human A33 
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antigen and that of mouse INTERCEPT 258 are 43.2% identical. These alignments were 
performed using the ALIGN alignment program with a P AMI 20 scoring matrix, a gap 
length penalty of 12, and a gap penalty of 4. 

FIGURE 27 A-27E depict an alignment of the nucleotide sequence of human 

5 PECAM-1, an integrin expressed on endothelial cells (SEQ ID NO:72) and the nucleotide 
sequence of human INTERCEPT 258 (SEQ ID NO:26). These two nucleotide sequences 
are 40.5% identical. This alignment was performed using ALIGN alignment program 
with a PAM120 scoring matrix, a gap length of 12, and a gap penalty of 4. 

FIGURE 28A-28B depict the cDNA sequence of human TANGO 281 (SEQ ID 

10 NO:46) and the predicted amino acid sequence of human TANGO 281 (SEQ ID NO:48). 
The open reading frame of SEQ ID NO:66 extends from nucleotide 65 to nucleotide 799 
of SEQ ID NO:46 (SEQ ID NO:47). 

FIGURE 29 depicts a hydropathy plot of human TANGO 281. Relatively 
hydrophobic regions of the protein are above the dashed horizontal line, and relatively 

15 hydrophilic regions of the protein are below the dashed horizontal line. The cysteine 
residues (cys) are indicated by short vertical lines just below the hydropathy trace. The 
dashed vertical line separates the signal sequence (amino acids 1 to 38 of SEQ ID NO:48; 
SEQ ID NO:49) on the left from the mature protein (amino acids 39 to 245 of SEQ ID 
NO:48; SEQ ID NO:50) on the right. Below the hydropathy plot, the amino acid sequence 

20 of human TANGO 28 1 is depicted. 

FIGURE 30 depicts an alignment of the amino acid sequence of photosystem II 10 
kD phosphoprotein domain (SEQ ID NO:69; GenBank Accession Number PF00737) and 
the amino acid sequence 97 to 146 of human TANGO 281 (SEQ ID NO:48). This 
alignment was performed using the ALIGN alignment program with a PAM120 scoring 

25 matrix, a gap length penalty of 12, and a gap penalty of 4. 

FIGURES 31A-31B depict the cDNA sequence of mouse TANGO 281 (SEQ ID 
NO:56) and the predicted amino acid sequence of mouse TANGO 281 (SEQ ID NO:58). 
The open reading frame of SEQ ID NO:56 extends from nucleotide 90 to nucleotide 728 
of SEQ ID NO:56 (SEQ ID NO:57). 

30 Figure 32 depicts a hydropathy plot of mouse TANGO 28 1 . Relatively 

hydrophobic regions of the protein are above the dashed horizontal line, and relatively 
hydrophilic regions of the protein are below the dashed horizontal line. The cysteine 
residues (cys) are indicated by short vertical lines just below the hydropathy trace. The 
dashed vertical line separates the signal sequence (amino acids 1 to 26 of SEQ ID NO:58; 

35 SEQ ID NO:59) on the left from the mature protein (amino acids 27 to 2 1 3 of SEQ ID 
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NO:58; SEQ ID NO:60) on the right. Below the hydropathy plot, the amino acid sequence 
of mouse TANGO 281 is depicted. 

FIGURE 33 depicts an alignment of the amino acid sequence of human TANGO 
281 (SEQ ID NO:48) and the amino acid sequence of mouse TANGO 281 (SEQ ID 
5 NO:58). The alignment demonstrates that the amino acid sequences of human and mouse 
TANGO 28 1 are 66.5% identical. This alignment was performed using the ALIGN 
program with a PAM120 scoring matrix, a gap length penalty of 12 and a gap penalty of 4. 

Detailed Description of the Invention 

10 The TANGO 253, TANGO 257, INTERCEPT 258 and TANGO 281 proteins and 

nucleic acid molecules comprise families of molecules having certain conserved structural 
and functional features. As used herein, the terms "family" or "families" are intended to 
mean two or more proteins or nucleic acid molecules having a common structural domain 
and having sufficient amino acid or nucleotide sequence identity as defined herein. 

15 Family members can be from either the same or different species. For example, a family 
can comprises two or more proteins of human origin, or can comprise one or more 
proteins of human origin and one or more of non-human origin. Members of the same 
family may also have common structural domains. 

For example, TANGO 253 proteins, TANGO 257 proteins, INTERCEPT 258 

20 proteins and TANGO 281 proteins of the invention have signal sequences. As used 
herein, a "signal sequence" includes a peptide of at least about 15 or 20 amino acid 
residues in length which occurs at the N-terminus of secretory and membrane-bound 
proteins and which contains at least about 70% hydrophobic amino acid residues such as 
alanine, leucine, isoleucine, phenylalanine, proline, tyrosine, tryptophan, or valine. In a 

25 preferred embodiment, a signal sequence contains at least about 10 to 40 amino acid 

residues, preferably about 19-34 amino acid residues, and has at least about 60-80%, more 
preferably 65-75%, and more preferably at least about 70% hydrophobic residues. A 
signal sequence serves to direct a protein containing such a sequence to a lipid bilayer. 
Thus, in one embodiment, a TANGO 253 protein contains a signal sequence of about 

30 amino acids 1 to 15 of SEQ ED NO:3 (SEQ ID NO:5) or about amino acids 1 to 15 of SEQ 
ID NO: 10 (SEQ ID NO: 12). In another embodiment, a TANGO 257 protein contains a 
signal sequence of about amino acids 1 to 21 of SEQ ID NO: 17 (SEQ ID NO: 19) or about 
amino acids 1 to 21 of SEQ ID NO:23 (SEQ ID NO:25). In another embodiment, an 
INTERCEPT 258 protein contains a signal sequence at about amino acids 1 to 29 of SEQ 

35 ID NO:28 (SEQ ID NO:30) or about amino acids 1 to 29 of SEQ ID NO:39 (SEQ ID 

NO:41). In yet another embodiment, a TANGO 281 protein contains a signal sequence of 
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about amino acids 1 to 38 of SEQ ID NO:48 (SEQ ID NO:49) or about amino acids 1 to 
26 of SEQ ID NO:58 (SEQ ID NO:59). The signal sequence is cleaved during processing 
of the mature protein. 

In one embodiment, TANGO 253 includes at least one RGD cell attachment site. 

5 An RGD domain contains a contiguous arginine-glycine-aspartic acid amino acid 
sequence and is involved in cell-cell, cell-extracellular matrix and cell adhesion 
interactions. In a preferred embodiment, a TANGO 253 family member has the amino 
acid sequence of SEQ ID NO:3 and, preferably, a RGD cell attachment site is located at 
about amino acid positions 77 to 79. 

10 TANGO 253 family members can also include a collagen domain. As used herein, 

the term "collagen domain" refers to a protein domain containing a G-X-Y amino acid 
repeat motif, wherein the first amino acid residue is glycine and the second and third 
amino acid residues can be any residue but are preferably proline or hydroxyproline. 
Typically, a collagen domain contains at least about 3 to 5 G-X-Y repeats, and can contain 

15 about 3, 5, 8, 10, 12, 15, 20 or more continuous G-X-Y repeats. In one embodiment, a 
collagen domain can fold to form a triple helical structure. 

In one embodiment, a TANGO 253 family member includes at least one collagen 
domain having an amino acid sequence that is at least about 40%, 50%, 60%, 70%, 80%, 
90%, 95% or 98% identical to amino acids 36 to 95 of SEQ ID NO:3, which is the 

20 collagen domain of human TANGO 253 (SEQ ID NO:6), or amino acids 36 to 95 of SEQ 
ID NO:10, which is the collagen domain of mouse TANGO 253 (SEQ ID NO:13), while 
maintaining a glycine residue at the first position of G-X-Y repeats within the domain to 
maintain at least 3, 5, 8, 10, 12, 15 or 20 contiguous G-X-Y repeats, or while most 
preferably maintaining a glycine repeat at the first position of each G-X-Y repeat within 

25 the domain. 

TANGO 253 family members can also include a Clq domain or at least one of the 
conserved amino acid motifs found therein. As used herein, the term "Clq domain" refers 
to a protein domain that bears homology to a Clq domain present within a member of the 
CI enzyme complex. A Clq domain typically includes about 130-140 amino acid 

30 residues. Clq domains are utilized in processes involving, e.g., correct protein folding and 
alignment and protein-protein interactions. 

In one embodiment, a TANGO 253 family member includes one or more Clq 
domains having an amino acid sequence that is at least 45%, preferably about 50%, 55%, 
60%, 70%, 75%, 80%, 90%, 95% and most preferably at least about 98% identical to 

35 amino acids 105 to 232 of SEQ ID NO:3, which is the human TANGO 253 Clq domain 
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(SEQ ED N0:7) or amino acids 105 to 232 of SEQ ID NO: 10, which is the mouse 
TANGO 253 Clq domain (SEQ ID NO: 14). 

Embodiments of TANGO 253 family members include, but are not limited to, 
human, mouse and rat TANGO 253 nucleic acids and proteins; The features of the human 

5 and mouse TANGO 253 are described below. A cDNA encoding a rat TANGO 253 
nucleotide sequence (SEQ ID NO:74), identified in clone jtrxaOO lei Otl, is 75.4% 
identical to human TANGO 253 (SEQ ID NO: 1) in a 536 bp overlap. Further, the isolated 
rat TANGO 253 nucleotide sequence (SEQ ED NO:74) is 86% identical to mouse TANGO 
253 (SEQ ID NO:9) in a 472 bp overlap. 

1 0 Embodiments of TANGO 257 family members include, but are not limited to, 

human, mouse and rat TANGO 257 nucleic acids and proteins. The features of the human 
and mouse TANGO 257 are described below. A cDNA encoding a rat TANGO 257 
nucleotide sequence (SEQ ID NO:75), identified within clone jtrxal02g06tl, is 83.8% 
identical to human TANGO 257 (SEQ ID NO: 15) in a 734 bp overlap. Further, the 

1 5 isolated rat TANGO 257 nucleotide sequence (SEQ ID NO:75) is 88.4% identical to 
mouse TANGO 257 (SEQ ID NO:21) in a 731 bp overlap. 

In one example, a TANGO 257 family member includes one or more of the 
following domains: (1) an extracellular domain; (2) a transmembrane domain; and (3) a 
cytoplasmic domain. In one embodiment, a TANGO 257 protein contains cytoplasmic 

20 domains of about amino residues 1 to 202 of SEQ ID NO: 17 (SEQ ID NO: 84) and about 
amino acid residues 338 to 406 of SEQ ID NO: 17 (SEQ ID NO:92), transmembrane 
domains of about amino acid residues 203 to 221 of SEQ ID NO: 17 (SEQ ID NO: 86) and 
about amino acid residues 321 to 337 of SEQ ID NO: 17 (SEQ ID NO:87), and an 
extracellular domain of about amino acid residues 222 to 320 of SEQ ID NO: 17 (SEQ ID 

25 NO:88). In an alternative embodiment, a TANGO 257 protein contains an extracellular 
domain of about amino acid residues 1 to 320 of SEQ ID NO: 17 (SEQ ID NO: 89) or a 
mature extracellular domain of about amino acid residues 22 to 320 of SEQ ID NO: 17 
(SEQ ID NO:90), a transmembrane domain of about amino acid residues 321 to 337 of 
SEQ ID NO: 17 (SEQ ID NO:87), and a cytoplasmic domain of about amino acid residues 

30 338 to 406 of SEQ ID NO:17 (SEQ ID NO:92). In another embodiment, a mature 
TANGO 257 protein contains about amino acid residues 22 to 406 of SEQ ID NO: 17 
(SEQ ID NO: 18). 

In another embodiment, a TANGO 257 protein contains intracellular domains of 
about amino acid residues 1 to 202 of SEQ ID NO:23 (SEQ ID NO:93) and about amino 
35 acid residues 338 to 406 of SEQ ID NO:23 (SEQ ID NO:94), transmembrane domains of 
about amino acid residues 203 to 221 of SEQ ID NO:23 (SEQ ID NO:95) and about 
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amino acid residues 321 to 337 of SEQ ID NO:32 (SEQ ID NO:96), and an extracellular 
domain of about amino acid residues 222 to 320 of SEQ ID NO:23 (SEQ ID NO:97). In 
alternative embodiment, a TANGO 257 protein contains an extracellular domain of about 
amino acid residues 1 to 320 of SEQ ID NO:23 (SEQ ID NO:98) or a mature extracellular 

5 domain of about amino acid residues 22 to 320 of SEQ ID NO:23 (SEQ ID NO:99), a 
transmembrane domain of about amino acid residues 321 to 337 of SEQ ID NO:25 (SEQ 
ID NO:96), and an intracellular domain of about amino acid residues 338 to 406 of SEQ 
ID NO:23 (SEQ ID NO:94). In another embodiment, a mature TANGO 257 protein 
contains about amino acid residues 22 to 406 of SEQ ID NO:23 (SEQ ID NO:24). 

10 In another example, an INTERCEPT 258 family member includes one or more of 

the following domains: (1) an extracellular domain; (2) a transmembrane domain; and (3) 
a cytoplasmic domain. Thus, in one embodiment, an INTERCEPT 258 protein contains 
extracellular domains of about amino acid residues 1 to 206 of SEQ ID NO:28 (SEQ ID 
NO:81) or about amino acid residues 30 to 206 of SEQ ID NO: 28 (SEQ ED NO:76) and 

1 5 about amino acid residues 272 to 370 of SEQ ID NO: 28 (SEQ ID NO:34), 

transmembrane domains of about amino acid residues 207 to 224 of SEQ ID NO:28 (SEQ 
ID NO:78) and about amino acid residues 247 to 271 of SEQ ID NO:28 (SEQ ID NO:33), 
and a cytoplasmic domain of about amino acid residues 225 to 246 of SEQ ID NO:28 
(SEQ ID NO:79). In an alternative embodiment, an INTERCEPT 258 protein contains an 

20 extracellular domain of about amino acid residues 272 to 370 of SEQ ID NO:28 (SEQ ID 
NO: 34), a transmembrane domain of about amino acid residues 247 to 271 of SEQ ID 
NO:28 (SEQ ID NO:33), and a cytoplasmic domain of about amino acid residues 1 to 246 
of SEQ ID NO:28 (SEQ ID NO:31) or a mature cytoplasmic domain of about amino acid 
residues 30 to 246 of SEQ ID NO:28 (SEQ ID NO:82). In accordance with these 

25 embodiments, an INTERCEPT 258 protein is a mature protein containing an extracellular, 
transmembrane and cytoplasmic domain of about amino acids 30 to 370 of SEQ ID NO:28 
(SEQIDNO:29). 

In another embodiment, an INTERCEPT 258 protein contains an extracellular 
domain of about amino acids 1 to 249 of SEQ ID NO:39 (SEQ ID NO:42), or a mature 

30 extracellular domain of about amino acids 30 to 249 of SEQ ID NO:39 (SEQ ID NO:83). 
In another embodiment, an INTERCEPT 258 protein contains a transmembrane domain of 
about amino acids 250 to 274 of SEQ ID NO:39 (SEQ ID NO:44). In another 
embodiment, an INTERCEPT 258 protein contains a cytoplasmic domain of about amino 
acids 275 to 394 of SEQ ID NO:39 (SEQ ID NO:45). In accordance with these 

35 embodiments, an INTERCEPT 258 protein is a mature protein containing an extracellular, 
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transmembrane and cytoplasmic domain of about 30 to 394 of SEQ ID NO:39 (SEQ ID 
NO:40). 

INTERCEPT 258 family members can also include an immunoglobulin (Ig) 
domain contained within the extracellular domain. As used herein, the term "Ig domain" 

5 refers to a protein domain bearing homology to immunoglobulin superfamily members. 
An Ig domain includes about 30-90 amino acid residues, preferably about 40-80 amino 
acid residues, more preferably about 50-70 amino acid residues, still more preferably 
about 55-65 amino acid residues, and most preferably about 57 to 59 amino acid residues. 
In certain embodiments, an Ig domain contains a conserved cysteine residue within about 

10 5 to 15 amino acid residues, preferably about 7 to 12 amino acid residues, and most 
preferably about 8 amino acid residues from its N-terminal end, and another conserved 
cysteine residue within about 1 to 5 amino acid residues, preferably about 2 to 4 amino 
acid residues, and most preferably about 3 amino acid residues from its C-terminal end. 
An Ig domain typically has the following consensus sequence, beginning about 1 

15 to 15 amino acid residues, more preferably about 3 to 10 amino acid residues, and most 
preferably about 5 amino acid residues from the C terminal end of the domain: (FY)-Xaa- 
C-Xaa-(VA)-COO-, wherein (FY) is either a phenylalanine or a tyrosine residue 
(preferably tyrosine), where "Xaa" is any amino acid, C is a cysteine residue, (VA) is 
either a valine or an alanine residue (preferably alanine), and COO- is the protein C 

20 terminus. 

In one embodiment, an INTERCEPT 258 family member includes one or more Ig 
domains having an amino acid sequence that is at least about 55%, preferably at least 
about 65%, more preferably at least 75%, yet more preferably at least about 85%, and 
most preferably at least about 95% identical to amino acids 49 to 128 and/or amino acids 

25 167 to 226 of SEQ ID NO:28, which are the Ig domains of human INTERCEPT 258 
(these Ig domains are also represented as SEQ ID NO:35 and 36, respectively). 

In another embodiment, an INTERCEPT 258 family member includes one or more 
Ig domains having an amino acid sequence that is at least about 55%, preferably at least 
about 65%, more preferably at least about 75%, yet more preferably at least about 85%, 

30 and most preferably at least about 95% identical to amino acids 1 67 to 226 of SEQ ID 
NO:28 (SEQ ID NO:36), includes a conserved cysteine residue about 8 residues 
downstream from the N-terminus of the Ig domain, and has one or more Ig domain 
consensus sequences described herein. In another embodiment, an INTERCEPT 258 
family member includes one or more Ig domains having an amino acid sequence that is at 

35 least 55%, preferably at least about 65%, more preferably at least about 75%, yet more 
preferably at least about 85%, and most preferably at least about 95% identical to amino 
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acids 167 to 226 of SEQ ID NO:28 (SEQ ID NO:36), includes a conserved cysteine 
residue 8 residues downstream from the N-terminus of the Ig domain, has one or more Ig 
domain consensus sequences described herein, and has a conserved cysteine within the 
consensus sequence that forms a disulfide both with said first conserved cysteine. In yet 

5 another embodiment, an INTERCEPT 258 family member includes one or more Ig 

domains having an amino acid sequence that is at least 55%, preferably at least about 65%, 
more preferably at least about 75%, yet more preferably at least about 85%, and most 
preferably at least about 95% identical to amino acids 167 to 226 of SEQ ID NO:28 (SEQ 
ID NO:36), includes a conserved cysteine residue 8 residues downstream from the N- 

1 0 terminus of the Ig domain, has one or more Ig domain consensus sequences described 
herein, has a conserved cysteine within the consensus sequence that forms a disulfide both 
with said first conserved cysteine, and has at least one INTERCEPT 258 biological 
activity as described herein. 

In a preferred embodiment, an INTERCEPT 258 family member has the amino 

1 5 acid sequence of SEQ ID NO:28 wherein the aforementioned Ig conserved residues are 
located as follows: the N-terminal conserved cysteine residue is located at about amino 
acid position 174 (within the Ig domain SEQ ID NO:36) and the C-terminal conserved 
cysteine is located at about amino acid position 224 (within the Ig domain SEQ ID 
NO:36). 

20 In another embodiment, an INTERCEPT 258 family member includes one or more 

Ig domains having an amino acid sequence that is at least about 55%, preferably at least 
about 65%, more preferably at least about 75%, yet more preferably at least about 85%, 
and most preferably at least about 95% identical to amino acids 170 to 229 of SEQ ID 
NO:39, which is the Ig domain of mouse INTERCEPT 258 (SEQ ID NO:71). In another 

25 embodiment, an INTERCEPT 258 family member includes one or more Ig domains 
having an amino acid sequence that is at least about 55%, preferably at least about 65%, 
more preferably at least about 75%, yet more preferably at least about 85%, and most 
preferably at least about 95% identical to amino acids 170 to 229 of SEQ ID NO:39 (SEQ 
ID NO:71), includes a conserved cysteine residue about 8 residues downstream from the 

30 N-terminus of the Ig domain, and has one or more Ig domain consensus sequences 
described herein, has a conserved cysteine within the consensus sequence that forms a 
disulfide both with said first conserved cysteine, and has at least one INTERCEPT 258 
biological activity as described herein. 

In a preferred embodiment, an INTERCEPT 258 family member has the amino 

3 5 acid sequence of SEQ ID NO:39 wherein the aforementioned Ig domain conserved 
residues are located as follows: the N-terminal conserved cysteine residue is located at 



-31 - 



WO 00/78808 



PCT/US00/16883 



about amino acid residue position 177 (within the Ig domain SEQ ID NO:71) and the C- 
terminal conserved cysteine residue is located at about amino acid position 227 (within the 
Ig domain SEQ ID NO:71). 

In another example, a TANGO 281 family member consists of one or more of the 

5 following domains: (1) an extracellular domain; (2) a transmembrane domain; and (3) a 
cytoplasmic domain. In one embodiment, a TANGO 281 protein contains an extracellular 
domain at amino acids 1 to about 123 of SEQ ID NO:48 or a mature extracellular domain 
at about amino acid residues 39 to 123 of SEQ ID NO:48 (SEQ ID NO:51), a 
transmembrane domain at about amino acid residues 124 to 148 of SEQ ID NO:48 (SEQ 

10 ED NO:52), and a cytoplasmic domain at about amino acid residues 149 to 245 of SEQ ID 
NO:48 (SEQ ID NO:53). In another embodiment, a mature TANGO 281 protein contains 
about amino acid residues 39 to 245 of SEQ ID NO: 48 (SEQ ID NO: 50). In another 
embodiment, a TANGO 28 1 family contains an extracellular domain at amino acids 1 to 
about 1 12 of SEQ ID NO:58 or a mature extracellular domain at about amino acid residues 

15 27 to 1 12 of SEQ ID NO:58 (SEQ ED NO:61), a transmembrane domain at about amino 
acid residues 1 13 to 137 of SEQ ID NO:78 (SEQ ID NO:62), and a cytoplasmic domain at 
about amino acid residues 138 to 213 of SEQ ID NO:78 (SEQ ID NO:63). In yet another 
embodiment, a mature TANGO 281 protein contains about amino acid residues 27 to 213 
of SEQ ID NO: 58 (SEQ ID NO: 61). 

20 In one embodiment, a TANGO 281 family member includes a signal sequence. In 

a preferred embodiment, a TANGO 281 family member has the amino acid sequence of 
SEQ ID NO:48, and the signal sequence is located at about amino acids 1 to 38. In an 
another preferred embodiment, a TANGO 281 family member has the amino acid 
sequence of SEQ ID NO:58, and the signal sequence is located at about amino acids 1 to 

25 26. 

A photosystem II lOkd phosphoprotein (PSBH) domain has been identified in the 
TANGO 281 proteins. The domain is also present in the chloroplast gene PSBH that 
encodes a 9-10kDa thylakoid membrane protein (PSII-H) which is associated with 
photosystem II. In one embodiment, a TANGO 281 family member includes one or more 

30 PSBH domains having an amino acid sequence that is at least about 55%, preferably at 
least about 65%, more preferably at least 75%, yet more preferably at least about 85%, and 
most preferably at least about 95% identical to amino acids 41 to 90 and/or amino acids 
127 to 182 of SEQ ID NO:48, which are the PSBH domains of human TANGO 281 (these 
PSBH domains are also represented as SEQ ID NO:54 and 55, respectively). In another 

35 embodiment, a TANGO 281 family member includes one or more PSBH domains having 
an amino acid sequence that is at least about 55%, preferably at least about 65%, more 



-32- 



WO 00/78808 



PCT/US00/16883 



preferably at least about 75%, yet more preferably at least about 85%, and most preferably 
at least about 95% identical to amino acids 41 to 90 and/or amino acids 127 to 182 of SEQ 
ID NO:48, which are the PSBH domains of human TANGO 281 (these PSBH domains are 
also represented as SEQ ED NO:54 and 55, respectively), includes one or more PSBH 

5 domain consensus sequences described herein, and has at least one TANGO 281 
biological activity as described herein. 

In another embodiment, a TANGO 281 family member includes one or more 
PSBH domains having an amino acid sequence that is at least about 55%, preferably at 
least about 65%, more preferably at least 75%, yet more preferably at least about 85%, and 

10 most preferably at least about 95% to 98% identical to amino acids 42 to 91 and/or amino 
acids 128 to 183 of SEQ ID NO:58, which are the PSBH domains of mouse TANGO 281 
(these PSBH domains are also represented as SEQ ID NO:64 and 65, respectively). In 
another embodiment, a TANGO 281 family member includes one or more PSBH domains 
having an amino acid sequence that is at least about 55%, preferably at least about 65%, 

1 5 more preferably at least about 75%, yet more preferably at least about 85%, and most 
preferably at least about 95% identical to amino acids 42 to 91 and/or amino acids 128 to 
183 of SEQ ID NO:58, which are the PSBH domains of mouse TANGO 281 (these PSBH 
domains are also represented as SEQ ID NO:64 and 65, respectively), includes one or 
more PSBH domain consensus sequences described herein, and has at least one TANGO 

20 281 biological activity as described herein. 

Various features of human and mouse TANGO 253, TANGO 257, INTERCEPT 
258 and TANGO 281 are summarized below. 

Human TANGO 253 

25 A cDNA encoding human TANGO 253 was identified by analyzing the sequences 

of clones present in a coronary artery smooth muscle library for sequences that encode 
secreted proteins. The primary cells utilized in construction of the library had been 
stimulated with agents that included phorbol 12-myristate 13-acetate (PMA), tumor 
neurosis factor (TNF), ionomycin, and cyclohexamide (CHX). This analysis led to the 

30 identification of a clone, Athma27h9, encoding full-length human TANGO 253. The 
human TANGO 253 cDNA of this clone is 1339 nucleotides long (Figures 1A-1B; SEQ 
ID NO:l). The open reading frame of this cDNA, nucleotides 188 to 916 of SEQ ID NO:l 
(SEQ ID NO:2), encodes a 243 amino acid secreted protein (Figures 1A-1B; SEQ ID 
NO:3). 

35 Figure 2 depicts a hydropathy plot of human TANGO 253. Relatively 

hydrophobic regions of the protein are shown above the horizontal line, and relatively 
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hydrophilic regions of the protein are below the horizontal line. The cysteine residues 
(cys) are indicated by short vertical lines just below the hydropathy trace. The dashed 
vertical line separates the signal sequence (amino acids 1 to 15 of SEQ ID NO:3; SEQ ID 
NO:5) on the left from the mature protein (amino acids 15 to 243 of SEQ ID NO:3; SEQ 

5 ID NO:4) on the right. 

The signal peptide prediction program SIGNALP (Nielsen et al., 1997, Protein 
Engineering 10:1 -6) predicted that human TANGO 253 includes a 15 amino acid signal 
peptide (amino acid 1 to amino acid 15 of SEQ ID NO:3; SEQ ID NO:5) preceding the 
mature human TANGO 253 protein (corresponding to amino acid 16 to amino acid 243 of 

10 SEQ ID NO:3; SEQ ID NO:4). The molecular weight of TANGO 253 protein without 
post-transiational modifications is 25.3 kDa prior to the cleavage of the signal peptide, 
23.8 kDa after cleavage of the signal peptide. 

Human TANGO 253 includes a collagen domain (at about amino acids 36 to 95 of 
SEQ ID NO:3; SEQ ID NO:6) and a Clq domain (at about amino acids 105 to 232 of SEQ 

15 ID NO:3; SEQ ID NO:7) containing 23 G-X-Y repeats. An RGD cell attachment site is 
found at amino acids 77 to 79 of SEQ ID NO:3. 

Three protein kinase C phosphorylation sites are present in human TANGO 253. 
The first has the sequence SAK (at amino acids 107 to 109 of SEQ ID NO:3), the second 
has the sequence TGK (at amino acids 140 to 142 of SEQ ID NO:3), and the third has the 

20 sequence SIK (at amino acids 220 to 222 of SEQ ID NO:3). Human TANGO 253 has 
three N-myristylation sites. The first has the sequence GLAAGS (at amino acids 1 1 to 16 
of SEQ ID NO:3), the second has the sequence GGRPGL (at amino acids 68 to 73 of SEQ 
ID NO:3) and the third has the sequence GIYASI (at amino acids 216 to 221 of SEQ ID 
NO:3). 

25 Northern analysis of human TANGO 253 expression demonstrates strong 

expression in heart, lung, liver, kidney and pancreas, and moderate expression in brain, 
placenta and skeletal muscle. Liver expression reveals two human TANGO mRNA bands, 
one of approximately 1 .3kb (which is the size observed in the other tissues) as well as a 
band at approximately lkb, which may be the result of an alternative splicing event. 

30 Secretion assays reveal a human TANGO 253 protein of approximately 30kDa. 

The secretion assays were performed as follows: 8xl0 5 293T cells were plated per well in 
a 6-well plate and the cells were incubated in growth medium (DMEM, 10% fetal bovine 
serum, penicillin/strepomycin) at 37 °C, 5% C0 2 overnight. 293T cells were transfected 
with 2 ng of full-length TANGO 253 inserted in the pMET7 vector/well and 10 [ig 

35 LipofectAMINE (GIBCO/BRL Cat. # 1 8324-012) /well according to the protocol for 
GIBCO/BRL LipofectAMINE. The transfectant was removed 5 hours later and fresh 
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growth medium was added to allow the cells to recover overnight. The medium was 
removed and each well was gently washed twice with DMEM without methionine and 
cysteine (ICN Cat. # 16-424-54). 1 ml DMEM without methionine and cysteine with 50 
nCi Trans- 35 S (ICN Cat. # 51006) was added to each well and the cells were incubated at 

5 37°C, 5% C0 2 for the appropriate time period. A 150 jjlI aliquot of conditioned medium 
was obtained and 150 jil of 2X SDS sample buffer was added to the aliquot. The sample 
was heat-inactivated and loaded on a 4-20% SDS-PAGE gel. The gel was fixed and the 
presence of secreted protein was detected by autoradiography. 

TANGO 253 exhibits homology to an adipocyte complement-mediated protein 

10 precursor and so may be involved in adipocyte function, e.g., may act as a signaling 

molecule for adipocyte tissue. Figure 6A shows an alignment of the human TANGO 253 
amino acid sequence (SEQ ID NO: 3) with the human adipocyte complement-mediated 
protein precursor amino acid sequence (SEQ ID NO:20). The alignment shows that there 
is a 38.7% overall amino acid sequence identity between human TANGO 253 and human 

1 5 adipocyte complement-mediated protein precursor. 

Figures 7A-7C shows an alignment of the nucleotide sequence of human adipocyte 
complement-mediated protein precursor nucleotide sequence (SEQ ID NO:32); GenBank 
Accession Number A1417523) and the nucleotide sequence of human TANGO 253 (SEQ 
ID NO:l). The alignment shows a 29.1% overall sequence identity between the two 

20 nucleotide sequences. 

The human TANGO 253 nucleotide sequence was mapped to human chromosome 
11, between flanking markers Dl IS 1356 and Dl 1S924 using the Genebridge 4 Human 
Radiation hybrid mapping panel with CAAAGTGAGCTCATGCTCTCAC (SEQ ID 
NO: 193) as the forward primer and CTCTGGTCTTGGGCAGAAATC (SEQ ID NO: 194) 

25 as the reverse primer. 

Clone EpT253, which encodes human TANGO 253, was deposited with the 
American Type Culture Collection (10801 University Boulevard, Manassas, VA 201 10- 
2209) on April 21, 1999 and assigned Accession Number 207222. This deposit will be 
maintained under the terms of the Budapest Treaty on the International Recognition of the 

30 Deposit of Microorganisms for the Purposes of Patent Procedure. This deposit was made 
merely as a convenience for those of skill in the art and is not an admission that a deposit 
is required under 35 U.S.C. §112. 

Mouse TANGO 253 

35 A cDNA encoding mouse TANGO 253 was identified by analyzing the sequences 

of clones present in a mouse microglia library using a rat TANGO 253 probe from sciatic 
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nerve. This analysis led to the identification of a clone, AtmXalel075, encoding full- 
length mouse TANGO 253. The mouse TANGO 253 cDNA of this clone is 1263 
nucleotides long (Figures 3A-3B; SEQ ID NO:8). The open reading frame of this cDNA, 
nucleotides 135 to 863 of SEQ ID NO:8 (SEQ ID NO:9), encodes a 243 amino acid 

5 secreted protein (Figures 3 A-3B; SEQ ID NO: 1 0). 

Figure 4 depicts a hydropathy plot of mouse TANGO 253. Relatively hydrophobic 
regions of the protein are shown above the horizontal line, and relatively hydrophilic 
regions of the protein are below the horizontal line. The cysteine residues (cys) are 
indicated by short vertical lines just below the hydropathy trace. The dashed vertical line 

10 separates the signal sequence (amino acid 1 to amino acid 15 of SEQ ID NO: 10; SEQ ID 
NO: 12) on the left from the mature protein (amino acid 16 to amino acid 243 of SEQ ID 
NO: 10; SEQ ID NO:l 1) on the right. 

The signal peptide prediction program SIGNALP (Nielsen et al., 1997, Protein 
Engineering 10:1-6) predicted that mouse TANGO 253 includes a 15 amino acid signal 

1 5 peptide (amino acid 1 to amino acid 1 5 of SEQ ID NO: 1 0; SEQ ID NO : 1 2) preceding the 
mature mouse TANGO 253 protein (corresponding to amino acid 16 to amino acid 243 of 
SEQ ID NO: 1 0; SEQ ID NO: 1 1). The molecular weight of mouse TANGO 253 protein 
without post-translational modifications is 25.4 kDa prior to the cleavage of the signal 
peptide, 23.9 kDa after cleavage of the signal peptide. 

20 Mouse TANGO 253 includes a collagen domain (at amino acids 36 to 95 of SEQ 

ID NO:10; SEQ ID NO:13) and a Clq domain (at amino acids 105-232 of SEQ ID NO:10; 
SEQ ID NO: 14). 

Three protein kinase C phosphorylation sites are present in mouse TANGO 253. 
The first has the sequence SAK (at amino acids 107 to 109 of SEQ ID NO: 10), the second 

25 has the sequence TGK (at amino acids 140 to 142 of SEQ ID NO: 1 0), and the third has the 
sequence SIK (at amino acids 220 to 222 of SEQ ID NO: 10). Mouse TANGO 253 has 
four N-myristylation sites. The first has the sequence GLVSGS (at amino acids 1 1 to 1 6 
of SEQ ID NO: 10), the second has the sequence GGRPGL (at amino acids 68 to 73 of 
SEQ ID NO: 10), the third has the sequence GQSIAS (at amino acids 172 to 177 of SEQ 

30 ID NO: 1 0), and the fourth has the sequence GIYASI (at amino acids 2 1 6 to 22 1 of SEQ 
ID NO: 10). 

As shown in Figure 5, human TANGO 253 protein and mouse TANGO 253 
protein are 93.8% identical. Figure 6B shows an alignment of the mouse TANGO 253 
amino acid sequence (SEQ ID NO: 10) with the human adipocyte complement-mediated 
35 protein precursor amino acid sequence (SEQ ID NO:20). The alignment shows that there 
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is a 38.3% overall amino acid sequence identity between mouse TANGO 253 and human 
adipocyte complement-mediated protein precursor. 

Figures 8A-8C shows an alignment of the nucleotide sequence of human adipocyte 
complement-mediated protein precursor nucleotide sequence (SEQ ID NO:32); GenBank 

5 Accession Number A1417523) and the nucleotide sequence of mouse TANGO 253 (SEQ 
ID NO:8). The alignment shows a 30.4% overall sequence identity between the two 
nucleotide sequences. 

In situ tissue screening was performed on mouse embryonic tissue (obtained from 
embryos at embryonic day 13.5 to postnatal day 1.5) and adult tissue to determine the 

10 expression of mouse TANGO 253 mRNA. Expression of mouse TANGO 253 during 

embryogenesis was ubiquitously expressed throughout the central nervous system. Strong 
expression of mouse TANGO 253 was detected in choriod plexus of the fourth ventricle 
of E18.5 and El. 5 embryos examined. Expression of mouse TANGO 253 was also 
detected in the lungs of E14.5 and E15.5 embryos and in the kidneys of E15.5 embryos. 

15 Mouse TANGO 253 expression was detected by in situ hybridization in the 

following adult tissues: a signal was detected in the brain in the choroid plexus of the 
lateral and 4th ventricles, and the olfactory bulb; a signal was detected in the cortical 
region of the kidney consistent with the pattern of glomeruli (in particular, the cortical 
radial veins); a ubiquitous signal was detected in the thymus; a weak, ubiquitous signal 

20 was detected in the spleen; a moderate signal was associated with the seminiferous 
vesicles of the testes; a signal was detected in the ovaries; and a ubiquitous signal 
restricted to the zone of giant cells was detected in the placenta. 

Clone EpTm253, which encodes mouse TANGO 253, was deposited with the 
American Type Culture Collection (10801 University Boulevard, Manassas, VA 201 10- 

25 2209) on April 21, 1999 and assigned Accession Number 207215. This deposit will be 
maintained under the terms of the Budapest Treaty on the International Recognition of the 
Deposit of Microorganisms for the Purposes of Patent Procedure. This deposit was made 
merely as a convenience for those of skill in the art and is not an admission that a deposit 
is required under 35 U.S.C. §112. 

30 

Uses of TANGO 253 Nucleic acids. Polypeptides, and Modulators Thereof 

As TANGO 253 was originally found in the coronary artery smooth muscle library 
described above, TANGO 253 nucleic acids, proteins, and modulators thereof can be used 
to modulate the proliferation, development, differentiation, and/or function of organs, e.g., 
35 tissues and cells that form blood vessels and coronary tissue, e.g., cells of the coronary 
connective tissue, e.g., abnormal coronary smooth muscle cells and/or endothelial cells of 
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blood vessels. TANGO 253 nucleic acids, proteins, and modulators thereof can also be 
used to modulate symptoms associated with abnormal coronary function, e.g., heart 
diseases and disorders such as atherosclerosis, coronary artery disease and plaque 
formation. 

5 In light of the collagen domain, TANGO 253 nucleic acids, proteins and 

modulators thereof can be utilized to modulate (e.g., stabilize, promote, inhibit or disrupt) 
cell/extracellular matrix (ECM) interactions, cell/cell interactions and, for example, signal 
transduction events associated with such interactions. For example, such TANGO 253 
compositions and modulators thereof can be used to modulate binding of such ECM- 

10 associated factors as integrin and can function to modulate ligand binding to cell surface 
receptors. In addition, TANGO 253 nucleic acids, proteins and modulators thereof can be 
utilized to modulate connective tissue formation, maintenance and function, as well as to 
modulate symptoms associated with connective tissue-related disorders, to promote wound 
healing, and to reduce, slow or inhibit ameliorate connective tissue-related signs of aging, 

] 5 such as wrinkle formation. 

In light of the Clq domain exhibited by TANGO 253 proteins and their similarity 
to the collectin family, TANGO 253 nucleic acids, proteins and modulators thereof can be 
utilized to modulate immune-related processes such as the ability to modulate host 
immune response by, e.g., modulating one or more elements in the serum complement 

20 cascade, including, for example activation of the cascade, formation of and/or binding to 
immune complexes, detection and defense against surface antigens and bacteria, and 
immune surveillance for rapid removal or pathogens. Such TANGO 253 compositions 
and modulators thereof can be utilized, e.g., to ameliorate incidence of any symptoms 
associated with disorders that involve such immune-related processes, including, but not 

25 limited to infection and autoimmune disorders. 

In addition, such compositions and modulators thereof can be utilized to modulate 
folding and alignment of the collagen domain (e.g., into a triple helix), disorders 
associated with collagen defects, including but not limited to bone disorders, e.g., bone 
resorption disorders, or hearing, e.g., inner ear, disorders, to modulate protein-protein 

30 interactions and recognition events (either homotypic or heterotypic) and cellular response 
events (e.g., signal transduction events) associated with such interactions and recognitions, 
and to ameliorate symptoms associated with abnormal signaling, protein-protein 
interaction and/or cellular response events including, but not limited to cell proliferation 
disorders such as cancer, abnormal neuronal interactions, such as disorders involving 

35 abnormal synaptic activity, e.g., abnormal Purkinje cell activities. 
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Human TANGO 253 protein contains an RGD domain. As such, TANGO 253 
nucleic acids, proteins and modulators thereof can be utilized to modulate processes 
involved in, e.g., bone development, sepsis, tumor progression, metastasis, cell migration, 
fertilization, and cellular interactions with the extracellular matrix required for growth, 
5 differentiation, and apoptosis, as well as cellular processes involving cell adhesion, such as 
cell migration. 

TANGO 253 proteins exhibit similarity to adipocyte complement-related protein 
precursor and can act as signaling molecules for adipocyte tissue. In light of this, TANGO 
253 nucleic acids, proteins and modulators thereof can be utilized to modulate adipocyte 
10 function and adipocyte-related processes and disorders such as, e.g., obesity. 

TANGO 253 nucleic acids, proteins, and modulators thereof can also be utilized to 
modulate the development, differentiation, maturation, proliferation and/or activity of cells 
of the central nervous system such as neurons, glial cells (e.g., astrocytes and 
oligodendrocytes), and Schwann cells. TANGO 253 nucleic acids, polypeptides, or 

1 5 modulators thereof can also be used to treat disorders of the brain, such as cerebral edema, 
hydrocephalus, brain herniations, iatrogenic disease (due to, e.g., infection, toxins, or 
drugs), inflammations (e.g., bacterial and viral meningitis, encephalitis, and cerebral 
toxoplasmosis), cerebrovascular diseases (e.g., hypoxia, ischemia, and infarction, 
intracranial hemorrhage and vascular malformations, and hypertensive encephalopathy), 

20 tumors (e.g., neuroglial tumors, neuronal tumors, tumors of pineal cells, meningeal 

tumors, primary and secondary lymphomas, intracranial tumors, and medulloblastoma), 
and to treat injury or trauma to the brain. 

TANGO 253 nucleic acids, proteins, and modulators thereof can also be utilized to 
treat renal (kidney) disorders, such as glomerular diseases (e.g., acute and chronic 

25 glomerulonephritis, rapidly progressive glomerulonephritis, nephrotic syndrome, focal 
proliferative glomerulonephritis, glomerular lesions associated with systemic disease, such 
as systemic lupus erythematosus, Goodpasture's syndrome, multiple myeloma, diabetes, 
polycystic kidney disease, neoplasia, sickle cell disease, and chronic inflammatory 
diseases), tubular diseases (e.g., acute tubular necrosis and acute renal failure, polycystic 

30 renal diseasemedullary sponge kidney, medullary cystic disease, nephrogenic diabetes, and 
renal tubular acidosis), tubulointerstitial diseases (e.g., pyelonephritis, drug and toxin 
induced tubulointerstitial nephritis, hypercalcemic nephropathy, and hypokalemic 
nephropathy), acute and rapidly progressive renal failure, chronic renal failure, 
nephrolithiasis, gout, vascular diseases (e.g., hypertension and nephrosclerosis, 

35 microangiopathic hemolytic anemia, atheroembolic renal disease, diffuse cortical necrosis, 
and renal infarcts), or tumors (e.g., renal cell carcinoma and nephroblastoma). 
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TANGO 253 nucleic acids, proteins and modulators thereof can, in addition to the 
above, be utilized to regulate or modulate development and/or differentiation of processes 
involved in microglial, lung, liver, kidney, pancreas, brain, placental and skeletal muscle 
formation and activity, as well as in ameliorating any symptom associated with a disorder 
5 of such cell types, tissues and organs. 

TANGO 253 expression can be utilized as a marker (e.g., an in situ marker) for 
specific tissues (e.g., the brain) and/or cells (e.g., neurons) in which TANGO 253 is 
expressed. TANGO 253 nucleic acids can also be utilized for chromosomal mapping. 

10 Human TANGO 257 

A cDNA encoding human TANGO 257 was identified by analyzing the sequences 
of clones present in a coronary smooth muscle library for sequences that encode secreted 
proteins. This analysis led to the identification of a clone, Athma7cl0, encoding full- 
length human TANGO 257. The human TANGO 257 cDNA of this clone is 1832 

15 nucleotides long (Figures 9A-9B; SEQ ID NO: 15). The open reading frame of this cDNA, 
nucleotides 88 to 1305 of SEQ ID NO: 15 (SEQ ID NO: 16), encodes a 406 amino acid 
secreted protein (Figures 9A-9B; SEQ ID NO: 17). 

Figure 10 depicts a hydropathy plot of human TANGO 257. Relatively 
hydrophobic regions of the protein are above the horizontal line, and relatively hydrophilic 

20 regions of the protein are below the horizontal line. The cysteine residues (cys) and N- 
glycosylation sites are (Ngly) are indicated by short vertical lines just below the 
hydropathy trace. The dashed vertical line separates the signal sequence from the mature 
protein described below. 

The signal peptide prediction program SIGNALP (Nielsen et al., 1997, Protein 

25 Engineering 10: 1-6) predicted that human TANGO 257 includes a 21 amino acid signal 
peptide (amino acid 1 to amino acid 21 of SEQ ID NO: 17; SEQ ID NO: 19) preceding the 
mature human TANGO 257 protein (corresponding to amino acid 22 to amino acid 406 of 
SEQ ID NO:17; SEQ ID NO:18). The molecular weight of human TANGO 257 protein 
without post-translational modifications is 46.0 kDa prior to the cleavage of the signal 

30 peptide, 43.8 kDa after cleavage of the signal peptide. 

Two N-glycosylation sites are present in human TANGO 257. The first has the 
sequence NDTA and is found at amino acids 1 77 to 1 80 of SEQ ID NO: 1 7, and the second 
has the sequence NRTV and is found at amino acids 248 to 251 of SEQ ID NO: 17. A 
cAMP and cGMP dependent protein kinase phosphorylation site having the sequence 

35 RKAS is found in human TANGO 257 at amino acids 196 to 199 of SEQ ID NO: 1 7. Five 
protein kinase C phosphorylation sites are present in human TANGO 257. The first has 
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the sequence SSR (at amino acids 48 to 50 of SEQ ID NO: 17), the second has the 
sequence SGR (at amino acids 84 to 86 of SEQ ID NO: 17), the third has the sequence 
SMK (at amino acids 144 to 146 of SEQ ID NO: 17), the fourth has the sequence TEK (at 
amino acids 166 to 168 of SEQ ID NO: 17) and the fifth has the sequence SLR (at amino 

5 acids 374 to 376 of SEQ ID NO: 17). Five casein kinase II phosphorylation sites are 

present in human TANGO 257. The first has the sequence TEAD (at amino acids 78 to 81 
of SEQ ID NO: 17), the second has the sequence TQND (at amino acids 175 to 178 of 
SEQ ID NO: 17), the third has the sequence TVVD (at amino acids 250 to 253 of SEQ ID 
NO: 17), the fourth has the sequence TYID (at amino acids 272 to 275 of SEQ ID NO: 17), 

1 0 and the fifth has the sequence TRED (at amino acids 289 to 292 of SEQ ID NO: 1 7). 
Human TANGO 257 has a tyrosine kinase phosphorylation site having the sequence 
RLEREVDY at amino acids 89 to 96 of SEQ ID NO:17). Human TANGO 257 has three 
N-myristylation sites. The first has the sequence GGPGTK (at amino acids 1 15 to 120 of 
SEQ ID NO: 17), the second has the sequence GGPAGL (at amino acids 152 to 157 of 

15 SEQ ID NO: 17) and the third has the sequence GAHASL (at amino acids 370 to 375 of 
SEQ ID NO: 17). Human TANGO 257 has an amidation site having the sequence KGRR 
at amino acids 122 to 125 of SEQ ID NO: 17. 

Northern analysis of human TANGO 257 expression demonstrates moderate 
expression in heart, liver and pancreas, and low expression in kidney, lung and skeletal 

20 muscle. 

Secretion assays reveal a human TANGO 257 protein of approximately 50kDa. 
The secretion assays were performed as described in the human TANGO 253 section 
above. 

The human TANGO 257 nucleotide sequence was mapped to human chromosome 
25 1 using the Genebridge 4 Human Radiation hybrid mapping panel with GGATGATGG 
CTACCAGATTGTC (SEQ ID NO: 195) as the forward primer and 
GGAACATTGAGGGTTTTGACTC (SEQ ID NO: 196) as the reverse primer. 

TANGO 257 is homologous to a protein encoded by a nucleic acid sequence 
referred to in PCT Publication WO 98/39446 as "gene 64" . Figure 14 shows an alignment 
30 of the human TANGO 257 amino acid sequence (SEQ ID NO: 1 7) with the gene 64 
encoded amino acid sequence (SEQ ID NO:43). As shown in the figure, the 353 amino 
acid gene 64 polypeptide is identical to amino acid residues 1-353 of human TANGO 257 
(SEQ ID NO: 17). Human TANGO 257 contains 406 amino acids, i.e., contains an 
additional 53 amino acid residues carboxy to residue 353. The overall amino acid 
35 sequence identity between full-length human TANGO 257 polypeptide and the gene 64- 
encoded polypeptide is approximately 87%. 



-41 - 



WO 00/78808 PCT/US00/16883 

Figures 15A-15D show an alignment of the nucleotide sequence of gene 64 (SEQ 
ID NO:66; PCT Publication WO 98/39446) and the nucleotide sequence of human 
TANGO 257 (SEQ ID NO: 15). The nucleotide sequences of gene 64 and human TANGO 
257 are 93.5% identical. Among the differences between the sequences is a cytosine 

5 nucleotide at human TANGO 257 (SEQ ID NO: 1 5) position 1 587 that represents an 
insertion relative to the corresponding gene 64 position when the gene 64 and TANGO 
. 257 sequences are aligned. This additional cytosine results in the TANGO 257 open 
reading frame being 1218 base pairs encoding a polypeptide of 406 amino acid residues. 
In contrast, the gene 64 nucleic acid sequence encodes a polypeptide of only 353 amino 

1 0 acid residues, as discussed above. 

Clone EpT257, which encodes human TANGO 257, was deposited with the 
American Type Culture Collection (10801 University Boulevard, Manassas, VA 201 10- 
2209) on April 21, 1999 and assigned Accession Number 207222. This deposit will be 
maintained under the terms of the Budapest Treaty on the International Recognition of the 

1 5 Deposit of Microorganisms for the Purposes of Patent Procedure. This deposit was made 
merely as a convenience for those of skill in the art and is not an admission that a deposit 
is required under 35 U.S.C. §112. 

Mouse TANGO 257 

20 A cDNA encoding mouse TANGO 257 was identified by analyzing the sequences 

of clones present in a mouse microglia library using a rat TANGO 257 probe. This 
analysis led to the identification of a clone, Atmual02gbl, encoding full-length mouse 
TANGO 257. The mouse TANGO 257 cDNA of this clone is 1721 nucleotides long 
(Figures 1 1A-1 IB; SEQ ID NO:21). The open reading frame of this cDNA, nucleotides 

25 31 to 1248 of SEQ ID NO:21 (SEQ ID NO:22), encodes a 406 amino acid secreted protein 
(Figures 1 1 A-l IB; SEQ ID NO:23). 

Figure 12 depicts a hydropathy plot of mouse TANGO 257. Relatively 
hydrophobic regions of the protein are above the horizontal line, relatively hydrophilic 
regions of the protein are below the horizontal line. The cysteine residues (cys) and N- 

30 glycosylation sites are (Ngly) are indicated by short vertical lines just below the 

hydropathy trace. The dashed vertical line separates the signal sequence from the mature 
protein described below. 

The signal peptide prediction program SIGNALP (Nielsen et al., 1997, Protein 
Engineering 10:1-6) predicted that mouse TANGO 257 includes a 21 amino acid signal 

35 peptide (amino acid 1 to amino acid 21 of SEQ ID NO:23; SEQ ID NO:25) preceding the 
mature TANGO 257 protein (corresponding to amino acid 22 to amino acid 406 of SEQ 
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ID NO:23; (SEQ ID NO:24). The molecular weight of mouse TANGO 257 protein 
without post-translational modifications is 45.8 kDa prior to the cleavage of the signal 
peptide, 43.6 kDa after cleavage of the signal peptide. 

Two N-glycosylation sites are present in mouse TANGO 257. The first has the 

5 sequence NDTA and is found at amino acids 177 to 180 of SEQ ID NO:23, and the second 
has the sequence NRTV and is found at amino acids 248 to 251 of SEQ ID NO:23. A 
cAMP and cGMP-dependent protein kinase phosphorylation site having the sequence 
RKAS is found in mouse TANGO 257 at amino acids 196 to 199 of SEQ ID NO:23. Five 
protein kinase C phosphorylation sites are present in mouse TANGO 257. The first has 

10 the sequence SSR (at amino acids 48 to 50 of SEQ ID NO:23), the second has the 
sequence TLR (at amino acids 75 to 77 of SEQ ED NO:23), the third has the sequence 
SGR (at amino acids 84 to 86 of SEQ ID NO:23), the fourth has the sequence SMK (at 
amino acids 144 to 146 of SEQ ID NO:23) and the fifth has the sequence SLR (at amino 
acids 374 to 376 of SEQ ID NO: 23). Five casein kinase II phosphorylation sites are 

15 present in mouse TANGO 257. The first has the sequence TEAD (at amino acids 78 to 81 
of SEQ ID NO:23), the second has the sequence TQND (at amino acids 175 to 178 of 
SEQ ID NO:23), the third has the sequence TVVD (at amino acids 250 to 253 of SEQ ID 
NO:23), the fourth has the sequence TYID (at amino acids 272 to 275 of SEQ ID NO:23), 
and the fifth has the sequence TRRD (at amino acids 289 to 292 of SEQ ID NO:23). 

20 Mouse TANGO 257 has a tyrosine kinase phosphorylation site having the sequence 

RLEREVDY at amino acids 89 to 96 of SEQ ID NO:23. Mouse TANGO 257 has four N- 
myristylation sites. The first has the sequence GGPGAK (at amino acids 1 15 to 120 of 
SEQ ID NO:23), the second has the sequence GGSVGL (at amino acids 151 to 157 of 
SEQ ID NO:23), the third has the sequence GGPGGG (at amino acids 227 to 232 of SEQ 

25 ID NO:23), and the fourth has the sequence GAHASL (at amino acids 370 to 375 of SEQ 
ID NO:23). Mouse TANGO 257 has an amidation site having the sequence KGRR at 
amino acids 122 to 125 of SEQ ID NO:23. 

As shown in Figure 13, human TANGO 257 protein and mouse TANGO 257 
protein are 94.1% identical. 

30 Figure 16 shows an alignment of mouse TANGO 257 amino acid sequence (SEQ 

ID NO:23) with the amino acid sequence encoded by gene 64 (SEQ ID NO:43). As 
shown in the figure, the 253 amino acid gene 64 polypeptide and the 406 amino acid 
mouse TANGO 257 polypeptide and the 406 amino acid mouse TANGO 257 polypeptide 
are approximately 82% identical. Figures 17A-17C show an alignment of the nucleotide 

35 sequence of gene 64 (SEQ ID NO:66; PCT publication no. 98/39446) and the nucleotide 
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sequence of mouse TANGO 257 (SEQ ID NO:21). As shown in the figure, the two 

nucleotide sequences are approximately 76% identical. 

In situ tissue screening was performed on mouse adult tissues and embryonic 

tissues (obtained from embryos E13.5 to PI. 5) to analyze for the expression of mouse 
5 TANGO 257 mRNA. Mouse TANGO 257 expression was detected the following adult 

tissues: the submandibular gland; the renal papilla region of the kidney; the capsule region 

of the adrenal gland; and the labyrinth zone of the placenta. 

In the case of embryonic expression, mouse TANGO 257 expression was detected 

in the bones, lungs, intestines, and kidneys. At E13.5, a signal was detected in many 
10 tissues including the developing bone structures such as the vertebrae, of the spinal 

column, jaw, and scapula. At E14.5, the signal pattern was very similar to that detected at 

E13.5. At 15.5, a signal was detected in all major bone structures, including the skull, 

basisphenoid bone, upper and lower incisor teeth, vertebral column, sternum, scapula, and 

femur. A ubiquitous signal was also detected in the lung, kidney, and intestinal tract. At 
15 16.5 and 18.5, the signal is very similar to that detected at E15.5. At PI .5, a signal was 

still detected in all of the major bone structures and signal detected in the lung, kidney, 

and intestines has dropped to nearly background levels. 

Clone EpTm257, which encodes mouse TANGO 257, was deposited with the 

American Type Culture Collection (10801 University Boulevard, Manassas, VA 201 10- 
20 2209) on April 21, 1999 and assigned Accession Number 207117. This deposit will be 

maintained under the terms of the Budapest Treaty on the International Recognition of the 

Deposit of Microorganisms for the Purposes of Patent Procedure. This deposit was made 

merely as a convenience for those of skill in the art and is not an admission that a deposit 

is required under 35 U.S.C. §112. 

25 

Uses of TANGO 257 Nucleic acids. Polypeptides, and Modulators Thereof 

As TANGO 257 was originally found in a coronary artery smooth muscle library, 
TANGO 257 nucleic acids, proteins, and modulators thereof can be used to modulate the 
proliferation, development, differentiation, and/or function of organs, e.g., heart, tissues 

30 and cells that form blood vessels and coronary tissue, e.g., cells of the coronary 

connective tissue, e.g., coronary smooth muscle cells and/or endothelial cells of blood 
vessels. TANGO 257 nucleic acids, proteins, and modulators thereof can also be used to 
modulate symptoms associated with abnormal coronary function, e.g., heart diseases and 
disorders such as atherosclerosis, coronary artery disease and plaque formation. 

35 In light of TANGO 257s homology to the extracellular molecule olfactomedin, 

TANGO 257 nucleic acids, proteins and modulators thereof can be utilized to modulate 
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development, differentiation, proliferation and/or activity of neuronal cells, e.g., olfactory 
neurons and to modulate neuronal activities involving maintenance, growth and/or 
differentiation of chemosensory cilia, modulate cell-cell interactions and cell-ECM 
interactions, e.g., neuronal (such as olfactory) cell-ECM interactions. TANGO 257 

5 nucleic acids, proteins and modulations thereof can also be used to modulate symptoms 
associated with abnormal processes involving such cells and/or activities, for example 
neuronal function, e.g., neurological disorders, neurodegenerative disorders, 
neuromuscular disorders, cognitive disorders, personality disorders, and motor disorders, 
and chemosensory disorders, such as olfactory-related disorders. 

10 TANGO 257 exhibits homology to a gene referred to as "gene 64" (PCT 

Publication No. WO 98/39446), which is expressed primarily in fetal lung tissue. In light 
of this, TANGO 257 nucleic acids, proteins and modulators thereof can also be used to 
modulate development, differentiation, proliferation and/or activity of pulmonary system 
cells, e.g., lung cell types, and to modulate a symptom associated with disorders of 

15 pulmonary development, differentiation and/or activity, e.g., cystic fibrosis. TANGO 257 
nucleic acids, proteins and modulators thereof can also be used to modulate symptoms 
associated with abnormal pulmonary development or function, such as lung diseases or 
disorders associated with abnormal pulmonary development or function, e.g., cystic 
fibrosis. TANGO 257 nucleic acids, polypeptides, or modulators thereof can be used to 

20 treat pulmonary (lung) disorders, such as atelectasis, cystic fibrosis, rheumatoid lung 
disease, pulmonary congestion or edema, chronic obstructive airway disease {e.g., 
emphysema, chronic bronchitis, bronchial asthma, and bronchiectasis), diffuse interstitial 
diseases {e.g., sarcoidosis, pneumoconiosis, hypersensitivity pneumonitis, bronchiolitis, 
Goodpasture's syndrome, idiopathic pulmonary fibrosis, idiopathic pulmonary 

25 hemosiderosis, pulmonary alveolar proteinosis, desquamative interstitial pneumonitis, 
chronic interstitial pneumonia, fibrosing alveolitis, hamman-rich syndrome, pulmonary 
eosinophilia, diffuse interstitial fibrosis, Wegener's granulomatosis, lymphomatoid 
granulomatosis, and lipid pneumonia), or tumors {e.g., bronchogenic carcinoma, 
bronchiolovlveolar carcinoma, bronchial carcinoid, hamartoma, and mesenchymal 

30 tumors). 

TANGO 257 nucleic acids, proteins and modulators thereof can also be used to 
modulate cell proliferation, e.g., abnormal cell proliferation. Such modulation may, for 
example, be via modulation of one or more elements involved in signal transduction 
cascades. 

35 TANGO 257 nucleic acids, proteins and modulators thereof can also be utilized to 

modulate the development, differentiation, maturation, proliferation and/or activity of 
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bone cells such as osteocytes, and to treat bone associated diseases or disorders. Examples 
of bone diseases and disorders include bone injury due to for example, trauma (e.g., bone 
breakage, cartilage tearing), degeneration (e.g., osteoporosis), degeneration of joints, e.g., 
arthritis, e.g., osteoarthritis, and bone wearing. Further, TANGO 257 nucleic acids, 

5 proteins and modulators thereof can be utilized to modulate or regulate the development of 
bone structures such as the skull, the basisphenoid bone, the upper and lower incisor teeth, 
the vertebral column, the sternum, the scapula, and the femur during embryogenesis. 

TANGO 257 nucleic acids, proteins and modulators thereof can, in addition to the 
above, be utilized to regulate or modulate development and/or differentiation of processes 

10 involved in microglial, liver, kidney, and skeletal muscle formation and activity, as well 
as in ameliorating a symptom associated with a disorder of such cell types, tissues and 
organs. 

TANGO 257 nucleic acids, polypeptides, or modulators thereof can also be used to 
treat renal (kidney) disorders, such as glomerular diseases (e.g., acute and chronic 

15 glomerulonephritis, rapidly progressive glomerulonephritis, nephrotic syndrome, focal 
proliferative glomerulonephritis, glomerular lesions associated with systemic disease, such 
as systemic lupus erythematosus, Goodpasture's syndrome, multiple myeloma, diabetes, 
polycystic kidney disease, neoplasia, sickle cell disease, and chronic inflammatory 
diseases), tubular diseases (e.g., acute tubular necrosis and acute renal failure, polycystic 

20 renal diseasemedullary sponge kidney, medullary cystic disease, nephrogenic diabetes, and 
renal tubular acidosis), tubulointerstitial diseases (e.g., pyelonephritis, drug and toxin 
induced tubulointerstitial nephritis, hypercalcemic nephropathy, and hypokalemic 
nephropathy), acute and rapidly progressive renal failure, chronic renal failure, 
nephrolithiasis, gout, vascular diseases (e.g., hypertension and nephrosclerosis, 

25 microangiopathic hemolytic anemia, atheroembolic renal disease, diffuse cortical necrosis, 
and renal infarcts), or tumors (e.g., renal cell carcinoma and nephroblastoma). 
TANGO 257 polypeptides, nucleic acids, or modulators thereof can be used to treat 
intestinal disorders, such as ischemic bowel disease, infective enterocolitis, Crohn's 
disease, benign tumors, malignant tumors (e.g., argentaffinomas, lymphomas, 

30 adenocarcinomas, and sarcomas), malabsorption syndromes (e.g., celiac disease, tropical 
sprue, Whipple's disease, and abetalipoproteinemia), obstructive lesions, hernias, 
intestinal adhesions, intussusception, or volvulus. 

Further, TANGO 257 expression can be utilized as a marker (e.g. an in situ 
marker) for specific tissues (i.e., bone structures) and/or cells (i.e., osteocytes) in which 

35 TANGO 257 is expressed. TANGO 257 nucleic acids can also be used for chromosomal 
mapping. 
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Human INTERCEPT 258 

A cDNA encoding human INTERCEPT 258 was identified by analyzing the 
sequences of clones present in a human mixed lymphocyte reaction library for sequences 
that encode secreted proteins. This analysis led to the identification of a clone, Athlxtce, 

5 encoding full-length human INTERCEPT 258. The human INTERCEPT 258 cDNA of 
this clone is 1869 nucleotides long (Figures 18A-18B; SEQ ID NO:26). The open reading 
frame of this cDNA, nucleotides 153 to 1262 of SEQ ID NO:26 (SEQ ID NO:27), encodes 
a 370 amino acid transmembrane protein (Figures 18A-18B; SEQ ID NO:28). 

Figure 19 depicts a hydropathy plot of human INTERCEPT 258. Relatively 

10 hydrophobic regions of the protein are shown above the horizontal line, and relatively 
hydrophilic regions of the protein are below the horizontal line. The cysteine residues 
(cys) are indicated by short vertical lines just below the hydropathy trace. The dashed 
vertical line separates the signal sequence (amino acids 1 to 29 of SEQ ID NO:28; SEQ ID 
NO:30) on the left from the mature protein (amino acids 30 to 370 of SEQ ID NO:28; 

1 5 SEQ ID NO:29) on the right. 

The signal peptide prediction program SIGNALP (Nielsen et al., 1997, Protein 
Engineering 10: 1-6) predicted that human INTERCEPT 258 includes a 29 amino acid 
signal peptide (amino acid 1 to amino acid 29 of SEQ ID NO:26; SEQ ID NO:30) 
preceding the mature INTERCEPT 258 protein (corresponding to amino acid 30 to amino 

20 acid 370 of SEQ ID NO:26; SEQ ID NO:29). The molecular weight of human 

INTERCEPT 258 protein without post-translational modifications is 40.0 kDa prior to the 
cleavage of the signal peptide, 37.0 kDa after cleavage of the signal peptide. 

Human INTERCEPT 258 contains a hydrophobic transmembrane domain at amino 
acids amino acids 207 to 224 of SEQ ID NO:28 (SEQ ID NO:78) and amino acids 247 to 

25 271 of SEQ ID NO:28 (SEQ ID NO:33). Human INTERCEPT 258 also contains two Ig 
domains, one at amino acids 49 to 128 of SEQ ID NO:28 (SEQ ID NO:35) and a second at 
amino acids 167 to 226 of SEQ ID NO:28 (SEQ ID NO:36). 

Five N-glycosylation sites are present in human INTERCEPT 258. The first has 
sequence NLSL and is found at amino acids 108 to 1 1 1 of SEQ ID NO:28, the second has 

30 the sequence NUTL and is found at amino acids 169 to 172 of SEQ ID NO:28; the third is 
has the sequence NLSS and is found at amino acids 213 to 216 of SEQ ID NO:28, the 
fourth has the sequence NUTL and is found at amino acids, 236 to 239 of SEQ ID NO:28, 
and the fifth has the sequence NGTL and is found at amino acids 307 to 310 of SEQ ID 
NO:28. Seven protein kinase C phosphorylation sites are present in human INTERCEPT 

35 258. The first has the sequence TSK and is found at amino acids 93 to 95 of SEQ ID 
NO:28, the second has the sequence SLR and is found at amino acids 1 10 to 1 12 of SEQ 
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ID NO:28, the third has the sequences SIK and is found at amino acids 141 to 143 or SEQ 
ID NO:28, the fourth has the sequence SCR and is found at amino acids 157 to 159, the 
fifth has the sequence SPR and is found at amino acids 176 to 179 of SEQ ID NO:28, the 
sixth has the sequence SAR and is found at amino acids 315 to 317 of SEQ ID NO:28, and 

5 the seventh has the sequence SPR and is found at amino acids 344 to 346 of SEQ ID 

NO:28. The human INTERCEPT 258 protein has seven N-myristoylation sites. The first 
has the sequence GUTTSK and is found at amino acids 90 to 95 of SEQ ID NO:28, the 
second has the sequence GANVTL and is found at amino acids 1 67 to 1 72 of SEQ ID 
NO:28, the third has the sequence GVYVCK and is found at amino acids 220 to 225, the 

10 fourth has the sequence GTAQCN and is found at amino acids 231 to 236 of SEQ ID 
NO:28, the fifth has the sequence GTLVGL and is found at amino acids 256 to 261, the 
sixth has the sequence GLLAGL and is found at amino acids 262 to 267 of SEQ ID 
NO:28, and the seventh has the sequence GTLSSU and is found at acids 308 to 313 of 
SEQ ID NO:28. 

15 The human INTERCEPT 258 gene was mapped to human chromosome 1 1 using 

Genebridge 4 Human Radiation hybrid mapping panel with 
GGAGTATCCTTGGTCTACTCC (SEQ ID NO: 197) as the forward primer and 
GAAAGTCTGGAAGGATGGAAGCT (SEQ ED NO: 198) as the reverse primer. 

Human multi-tissue dot blot analysis of human INTERCEPT 258 expression 

20 demonstrates strongest expression in lung, fetal lung, placenta, thyroid gland and 

mammary gland. Moderate expression is observed in heart, aorta, kidney, small intestine, 
fetal heart, fetal kidney, fetal spleen, uterus, and stomach. Weak expression is observed in 
whole brain, amygdala, caudate nucleus, cerebellum, cerebral cortex frontal lobe, 
hippocampus, medulla oblongata, occipital lobe, putamen, substantia nigra, temporal lobe, 

25 thalamus, acumens, spinal cord, skeletal muscle, colon, bladder, prostate, ovary, pancreas, 
pituitary gland, adrenal gland, salivary gland, liver, spleen, thymus, lymph node, bone 
marrow, appendix, trachea, fetal brain, fetal liver, and fetal thymus. 

A human cancer cell line Northern blot analysis showed a roughly 2.0 kb 
INTERCEPT 258 band only in the lane containing cell line Chronic Myelogenous 

30 Leukemia (K-562). The cancerous cell lines in which INTERCEPT 258 was not 
expressed include promyeocytic leukemia, Hela, lymphoblastic leukemia, Burkitt's 
lymphoma Raji, colorectal adenocarcinoma, lung carcinoma and melanoma. 

INTERCEPT 258 exhibits homology to a human A33 antigen. A33 antigen is a 
transmembrane glycoprotein and a member of the immunoglobulin superfamily that may 

35 represent a cancer cell marker (Heath et al., 1997, Proc. Natl. Acad. Sci. USA 94:469- 

474). Figure 23 shows an alignment of the human INTERCEPT 258 amino acid sequence 
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(SEQ ED NO:28) with the human A33 amino acid sequence (SEQ ID NO:67). The 
alignment shows that there is a 23.0% overall amino acid sequence identity between 
human INTERCEPT 258 and A33. Figures 24A-24D show an alignment of the human 
INTERCEPT 258 nucleotide sequence (SEQ ID NO:26) with that of human A33 

5 nucleotide sequence (SEQ ID NO:68). The alignment shows that there is a 40.6% identity 
between the two sequences. 

Human INTERCEPT 258 nucleotide sequence (SEQ ID NO:26) exhibits 
homology to human PECAM-1 nucleotide sequence (SEQ ID NO:72). Figures 27A-27E 
show that there is an overall 40.5% identity between the two nucleotide sequences. 

1 0 Human INTERCEPT 258 amino acid sequence (SEQ ID NO:28) and human PEC AM- 1 
amino acid sequence (SEQ ID NO:73) share less than 18% identity. PECAM-1 (platelet 
endothelial cell adhesion molecule- 1) is an integrin expressed on endothelial cells. 

Clone EpT258, which encodes human INTERCEPT 258, was deposited with the 
American Type Culture Collection (10801 University Boulevard, Manassas, VA 201 10- 

1 5 2209) on April 2 1 , 1 999 and assigned Accession Number 207222. This deposit will be 
maintained under the terms of the Budapest Treaty on the International Recognition of the 
Deposit of Microorganisms for the Purposes of Patent Procedure. This deposit was made 
merely as a convenience for those of skill in the art and is not an admission that a deposit 
is required under 35 U.S.C. §112. 

20 

Mouse INTERCEPT 258 

A cDNA encoding mouse INTERCEPT 258 was identified by analyzing the 
sequences of clones present in a mouse megakaryocyte library for sequences that encode 
secreted proteins. This analysis led to the identification of a clone, Athmeal7c8, encoding 

25 full-length mouse INTERCEPT 258. The mouse INTERCEPT 258 cDNA of this clone is 
1 846 nucleotides long (Figures 20A-20B; SEQ ID NO:37). The open reading frame of 
this cDNA, nucleotides 107 to 1288 of SEQ ID NO:37 (SEQ ID NO:38), encodes a 394 
amino acid transmembrane protein (Figures 20A-20B, SEQ ID NO:39). 

Figure 21 depicts a hydropathy plot for mouse INTERCEPT 258. Relatively 

30 hydrophobic regions of the protein are above the horizontal line, relatively hydrophilic 
regions of the protein are below the horizontal line. The cysteine residues (cys) and N- 
glycosylation sites are (Ngly) are indicated by short vertical lines just below the 
hydropathy trace. The dashed vertical line separates the signal sequence from the mature 
protein described below. 

35 The signal peptide prediction program SIGNALP (Nielsen et al., 1997, Protein 

Engineering 10:1-6) predicted that mouse INTERCEPT 258 includes a 29 amino acid 
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signal peptide (amino acid 1 to amino acid 29 of SEQ ID NO:39; SEQ ID NO:41) 
preceding the mature INTERCEPT 258 protein (corresponding to amino acid 30 to amino 
acid 394 of SEQ ID NO:39; SEQ ID NO:40). The molecular weight INTERCEPT 258 
without post-translational modifications is 41.8 kDa prior to the cleavage of the signal 

5 peptide, 38.90 kDa after cleavage of the signal peptide. 

Mouse INTERCEPT 258 contains a hydrophobic transmembrane domain at amino 
acids 250 to 274 SEQ ID NO:39 (SEQ ID NO:44). Mouse INTERCEPT 258 also contains 
an Ig domain at amino acids 170 to 229 of SEQ ID NO:39 (SEQ ID NO:71). 

Five N-glycosylation sites are present in mouse INTERCEPT 258. The first has 

10 sequence NVSL and is found at amino acids 1 1 1 to 1 14 of SEQ ID NO:39, the second has 
the sequence NVTL and is found at amino acids 172 to 175 of SEQ ID NO:39, the third 
has the sequence NLSI and is found at amino acids 216 to 219 of SEQ ID NO:39, the 
fourth has the sequence NVTL and is found at amino acids, 239 to 242 of SEQ ID NO:39, 
and the fifth has the sequence NGTL and is found at amino acids 3 10 to 3 13 of SEQ ID 

15 NO:39. Nine protein kinase C phosphorylation sites are present in mouse INTERCEPT 
258. the first has the sequence TNK and is found at amino acids 96 to 98 of SEQ ID 
NO:39, the second has the sequence SSR and is found at amino acids 1 08 to 1 1 0 of SEQ 
ID NO:39, the third has the sequence SLR and is found at amino acids 1 13 to 1 15 of SEQ 
ID NO:39, the fourth has the sequence TYR and is found at amino acids 126 to 128, the 

20 fifth has the sequence SIK and is found at amino acids 144 to 146 of SEQ ED NO:39, the 
sixth has the sequence SPR and is found at amino acids 179 to 181 of SEQ ID NO:39, the 
seventh has the sequence SLK and is found at amino acids 211 and 213, the eighth has the 
sequence SAR and is found at amino acids 318 to 320 of SEQ ID NO:39, and the ninth 
has the sequence SPR and is found at amino acids 348 to 350 of SEQ ID NO:39. The 

25 mouse INTERCEPT 258 contains a casein kinase II phosphorylation site having the 
sequence TLEE, found at amino acids 280 to 283 of SEQ ID NO:39. The mouse 
INTERCEPT 258 protein has nine N-myristoylation sites. The first has the sequence 
GTPETS and is found at amino acids 6 to 1 1 of SEQ ID NO:39, the second has the 
sequence GVMTNK and is found at amino acids 125 to 130 of SEQ ID NO:39, the third 

30 has the sequence GTYRCS and is found at amino acids 125 to 130, the fourth has the 
sequence GTNVTL and is found at amino acids 170 to 175 of SEQ ID NO:39, the fifth 
has the sequence GVYVCK and is found at amino acids 223 to 228, the sixth has the 
sequence GSKAAV and is found at amino acids 247 to 252, the seventh has the sequence 
GAVVGT and is found at amino acids 255 to 260 of SEQ ID NO:39, the eighth has 

35 sequence GTLSSV and is found at amino acids 3 1 1 to 3 1 6 of SEQ ID NO:39, and the 
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ninth has the sequence GGVSSS and is found at amino acids 367 to 372 of SEQ ID 
NO:39. 

An in situ expression analysis of INTERCEPT 258 was performed as summarized 
herein. Mouse INTERCEPT 258 expression during embryogenesis (E73.5 to PI. 5 were 
5 examined) was observed throughout the animal in a punctate pattern. This pattern is very 
similar to that seen with the molecule PECAM-1, but at a lower intensity. PECAM-1 is an 
integrin expressed on endothelial cells. In addition, lung and brown fat exhibited a much 
higher signal in a more ubiquitous pattern in all embryonic stages examined. Heart and 
kidney also have a higher expression, but to a lesser degree. Adult mouse INTERCEPT 
10 258 expression was seen in many tissues, often in a multifocal, punctate pattern suggestive 
of vessels. Expression was also predominant in many highly vascularized tissues such as 
ovary (especially the septol region), kidney and adrenal cortex. 

In general, both embryonic and adult expression patterns were suggestive of 
endothelial cells being a component in the expression patters observed. In summary, 
15 tissues in which INTERCEPT 258 expression was observed were as follows: brain, eye, 
harderian gland, submanibular gland, bladder, brown fat, stomach, heart, kidney, adrenal 
gland, colon, liver, thymus, lymph node, spleen, spinal cord, ovary, testes and placenta. 

As shown in Figure 22, human INTERCEPT 258 protein and mouse INTERCEPT 
258 protein are 62.8% identical. 
20 Mouse INTERCEPT 258 exhibits homology to a human A33 antigen. Figure 25 

shows an alignment of mouse INTERCEPT 258 amino acid sequence (SEQ ID NO:39) 
with the human A33 amino acid sequence (SEQ ID NO:96). The alignment shows that 
there is a 23% overall amino acid sequence identity between the two sequences. Figures 
26A-26D show an alignment of the mouse INTERCEPT 258 nucleotide sequence (SEQ 
25 ID NO:37) with that of the human A33 nucleotide sequence (SEQ ID NO:97). The 
alignment shows that there is a 40% identity between these two nucleotide sequences. 

Clone EpTm258, which encodes mouse INTERCEPT 258, was deposited with the 
American Type Culture Collection (10801 University Boulevard, Manassas, VA 201 10- 
2209) on April 21, 1999 and assigned Accession Number 207221 . This deposit will be 
30 maintained under the terms of the Budapest Treaty on the International Recognition of the 
Deposit of Microorganisms for the Purposes of Patent Procedure. This deposit was made 
merely as a convenience for those of skill in the art and is not an admission that a deposit 
is required under 35 U.S.C. §112. 

35 Uses of INTERCEPT 258 Nucleic acids. Polypeptides, and Modulators Thereof 
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INTERCEPT 258 was identified as being expressed in a mixed lymphocyte library. 
In light of this, INTERCEPT 258 nucleic acids, proteins and modulators thereof can be 
utilized to modulate processes involved in lymphocyte development, differentiation and 
activity, including, but not limited to development, differentiation and activation of T 

5 cells, including T helper, T cytotoxic and non-specific T killer cell types and subtypes, and 
B cells, immune functions associated with such cells, and amelioration of one or more 
symptoms associated with abnormal function of such cell types. Such disorders can 
include, but are not limited to, autoimmune disorders, such as organ specific autoimmune 
disorders, e.g., autoimmune thyroiditis, Type I diabetes mellitus, insulin-resistant diabetes, 

10 autoimmune anemia, multiple sclerosis, and/or systemic autoimmune disorders, e.g., 
rheumatoid arthritis, lupus or sclerodoma, allergy, including allergic rhinitis and food 
allergies, asthma, psoriasis, graft rejection, transplantation rejection, graft versus host 
disease, pathogenic susceptibilities, e.g., susceptibility to certain bacterial or viral 
pathogens, wound healing and inflammatory reactions. 

1 5 INTERCEPT 258 includes one or more Ig domains. INTERCEPT 258 nucleic 

acids, proteins, and modulators thereof can, therefore, be used to modulate immune 
function, e.g., by the modulation of immunoglobulins and the formation of antibodies. 
For the same reason, INTERCEPT 258 nucleic acids, proteins, and modulators thereof can 
be used to modulate immune response, leukocyte trafficking, cancer, Type I immunologic 

20 disorders, e.g., anaphylaxis and/or rhinitis, by modulating the interaction between antigens 
and cell receptors, e.g., high affinity IgE receptors, 

INTERCEPT 258 exhibits homology to PECAM-1, a cell adhesion integrin 
molecule that has been shown to mediate cell-cell interactions, play an important role in 
bidirectional signal transduction, and may be involved in thrombotic, inflammatory and 

25 immunological disorders. As such, INTERCEPT 258 nucleic acids, proteins, and 
modulators thereof can be utilized to modulate cell/cell interactions and, for example, 
signal transduction events associated with such interactions. For example, such 
INTERCEPT 258 compositions and modulators thereof can be used to modulate binding 
of cellular factors or ECM-associated factors such as integrin and can function to modulate 

30 ligand binding to cell surface receptors. Further, such INTERCEPT 258 compositions and 
modulators thereof can be utilized to ameliorate at least one symptom associated with 
thrombotic disorders, e.g., stroke, inflammatory processes or disorders, and immune 
disorders. 

In light of INTERCEPT 258 expression, INTERCEPT 258 nucleic acids, proteins 
35 and modulators thereof can be utilized modulate development, differentiation, 

proliferation and/or activity of pulmonary system cells, e.g., lung cell types, and to 
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modulate a symptom associated with disorders of pulmonary development, differentiation 
and/or activity, such as lung diseases or disorders associated with abnormal pulmonary 
development or function, e.g., cystic fibrosis. INTERCEPT 258 nucleic acids, proteins 
and modulators thereof can also be utilized modulate development, differentiation, 

5 proliferation and/or activity of thyroid cells, megakaryocytes or mammary gland cells, 

and can further be utilized to ameliorate at least one symptom of disorders associated with, 
abnormal thyroid function, e.g., thyroiditis or Grave's disease, abnormal megakaryocyte 
differentiation or function, e.g., anemias or leukemias, hematological diseases such as 
thrombocytopenia, platelet disorders and bleeding disorders, such as hemophilia or 

10 abnormal mammary development or function. 

INTERCEPT 258 nucleic acids, polypeptides, or modulators thereof can be used to 
treat renal (kidney) disorders, such as glomerular diseases (eg., acute and chronic 
glomerulonephritis, rapidly progressive glomerulonephritis, nephrotic syndrome, focal 
proliferative glomerulonephritis, glomerular lesions associated with systemic disease, such 

15 as systemic lupus erythematosus, Goodpasture's syndrome, multiple myeloma, diabetes, 
polycystic kidney disease, neoplasia, sickle cell disease, and chronic inflammatory 
diseases), tubular diseases (e.g., acute tubular necrosis and acute renal failure, polycystic 
renal diseasemedullary sponge kidney, medullary cystic disease, nephrogenic diabetes, and 
renal tubular acidosis), tubulointerstitial diseases (e.g., pyelonephritis, drug and toxin 

20 induced tubulointerstitial nephritis, hypercalcemic nephropathy, and hypokalemic 
nephropathy), acute and rapidly progressive renal failure, chronic renal failure, 
nephrolithiasis, gout, vascular diseases (e.g., hypertension and nephrosclerosis, 
microangiopathic hemolytic anemia, atheroembolic renal disease, diffuse cortical necrosis, 
and renal infarcts), or tumors (e.g., renal cell carcinoma and nephroblastoma). 

25 INTERCEPT 258 nucleic acids, polypeptides, or modulators thereof can also be used to treat 
disorders of the brain, such as cerebral edema, hydrocephalus, brain herniations, iatrogenic 
disease (due to, e.g., infection, toxins, or drugs), inflammations (e.g., bacterial and viral 
meningitis, encephalitis, and cerebral toxoplasmosis), cerebrovascular diseases (e.g., hypoxia, 
ischemia, and infarction, intracranial hemorrhage and vascular malformations, and 
hypertensive encephalopathy), and tumors (e.g., neuroglial tumors, neuronal tumors, tumors 
of pineal cells, meningeal tumors, primary and secondary lymphomas, intracranial tumors, 
and medulloblastoma), and to treat injury or trauma to the brain. 

INTERCEPT 258 nucleic acids, proteins, and modulators thereof can still further 
be utilized to modulate development, differentiation proliferation and/or activity of cells 

^ involved in kidney or heart formation and function. In addition, such compositions and 
modulators thereof can be utilized to ameliorate at least one symptom of disorders 
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associated with abnormal kidney or heart formation or function, including, but not limited 
to nephritis, coronary disease, atherosclerosis and plaque formation. 

INTERCEPT 258 expression indicates that INTERCEPT 258 is involved, in 
addition to the above, in such processes as thermogenesis, adipocyte function, and 

5 vascularization. As such, INTERCEPT 258 nucleic acids, proteins, and modulators 

thereof can be utilized to modulate such processes as well as for ameliorating at least one 
symptom associated with such processes. Such disorders include, but are not limited to 
obesity, regulation of body temperature, and disorders involving abnormal vascularization, 
e.g., vascularization of solid tumors. 

10 In further light of INTERCEPT 258 expression, as well as in light of its homology 

to A33 antigen, INTERCEPT 258 nucleic acids, proteins and modulators thereof can be 
utilized to modulate cell proliferation, including, for example, epithelial, e.g., 
gastrointestinal tract epithelial cell proliferation, and to ameliorate at least one symptom of 
cell proliferative disorders such as cancer, and, in particular, chronic myelogenous 

15 leukemia, colon cancers, small bowel epithelium cancers and other gastrointestinal tract 
cancers. Further, INTERCEPT 258 expression can be utilized as a marker for specific 
tissues {e.g., vascularized tissues) and/or cells (e.g., endothelial cells) in which 
INTERCEPT 258 is expressed. INTERCEPT 258 nucleic acids can also be utilized for 
chromosomal mapping. 

20 

Human TANGO 281 

A cDNA encoding human TANGO 28 1 was identified by analyzing the sequences of 
clones present in a human megakarocyte cDNA library. This analysis led to the identification 
of a clone, AThPb8 1 d 1 0, encoding full-length human TANGO 28 1 . The human TANGO 28 1 

25 cDNA of this clone is 1812 nucleotides long (Figures 28A-28B; SEQ ID NO:46). The open 
reading frame of this cDNA, nucleotides 65 to 799 of SEQ ID NO:46 (SEQ ID NO:47), 
encodes a 245 amino acid transmembrane protein (Figures 28A-28B; SEQ ID NO:48). 

The signal peptide prediction program SIGNALP (Nielsen, et al. (1997) Protein 
Engineering 10:1-6) predicted that human TANGO 281 includes an 38 amino acid signal 

30 peptide (amino acid 1 to amino acid 38 of SEQ ID NO:48; SEQ ID NO:49) preceding the 
mature TANGO 281 protein (corresponding to amino acid 39 to amino acid 245 of SEQ ID 
NO:48; SEQ ID NO:50). The molecular weight of TANGO 281 without post-translational 
modifications is 26.5 kDa prior to the cleavage of the signal peptide, 20.2 kDa after cleavage 
of the signal peptide. 

35 Human TANGO 281 is a transmembrane protein which contains one or more of the 

following domains: (1) an extracellular domain; (2) a transmembrane domain; and (3) a 
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cytoplasmic domain. The human TANGO 281 protein contains an extracellular domain at 
amino acids 1 to 1 23 of SEQ ID NO:48 or a mature extracellular domain at about amino acid 
residues 39 to 123 of SEQ ID NO:48 (SEQ ID NO:51), a transmembrane domain at amino 
acid residues 124 to 148 of SEQ ID NO:48 (SEQ ID NO:52), and a cytoplasmic domain at 

5 amino acid residues 149 to 245 of SEQ ID NO:48 (SEQ ID NO:53). 

Figure 29 depicts a hydropathy plot of human TANGO 281. Relatively hydrophobic 
regions of the protein are shown above the horizontal line, and relatively hydrophilic regions 
of the protein are below the horizontal line. The cysteine residues (cys) and potential 
N-glycosylation sites (Ngly) are indicated by short vertical lines just below the hydropathy 

1 0 trace. The dashed vertical line separates the signal sequence (amino acids 1 to 38 of SEQ ID 
NO:48; SEQ ID NO:49) on the left from the mature protein (amino acids 38 to 245 of SEQ 
ID NO:48; SEQ ID NO:50) on the right. 

Human TANGO 281 comprises photosystem II 10 kD phosphoprotein (PSBH) 
domain sequences, which have been shown to be phosphorylated in a light-dependent 

15 reaction, from amino acids 41 to 90 and 127 to 182 of SEQ ID NO:48 (SEQ ID NO:54 and 
SEQ ID NO:55, respectively). Figure 30 depicts an alignment between the PSBH domain 
(SEQ ID NO:69; Accession No. PF00737) and human TANGO 281 from amino acids 97 to 
146 of SEQ ID NO:48. An N-glycosylation site having the sequence NTTT is present in 
TANGO 281 at about amino acids 160 to 163 of SEQ ID NO:48. Two protein kinase C 

20 phosphorylation sites are present in human TANGO 28 1 . The first has the sequence S VR (at 
amino acids 8 to 10 of SEQ ID NO:48), and the second has the sequence SSR (at amino acids 
87 to 89 of SEQ ID NO:48). Three casein kinase II phosphorylation sites are present in 
human TANGO 281. The first has the sequence SIPE (at amino acids 49 to 52 of SEQ ID 
NO:48), the second has the sequence SCPD (at amino acids 53 to 56 of SEQ ID NO:48), and 

25 the third has the sequence SSLD (at amino acids 108 to 1 1 1 of SEQ ID NO:48). Human 
TANGO 281 has two N-myristylation sites. The first has the sequence GSCSSQ (at amino 
acids 60 to 65 of SEQ ID NO:48), and the second has the sequence GATVAI (at amino acids 
119 to 124 of SEQ ID NO:48). 

Nucleic acid base pairs 413 to 746 of human TANGO 281 (SEQ ID NO:46) have 

30 8 1 % identity to the nucleic acid sequence identified as Accession Number AV34245 . Nucleic 
acid base pairs 438 to 746 of human TANGO 281 (SEQ ID NO:46) have 80% identity to a 
nucleic acid sequence referred to as "gene 31" described in PCT Publication No. WO 
98/39446 (SEQ ID NO:70). "Gene 31" is characterized as being expressed primarily in brain 
and thymus, and to a lesser extent in such organs as liver, skin, bone and bone marrow. 

35 Clone EpT281 was deposited with the American Type Culture Collection (10801 

University Boulevard, Manassas, VA 201 1 0-2209) on April 2 1, 1999 and assigned Accession 
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Number 207222. This deposit will be maintained under the terms of the Budapest Treaty on 
the International Recognition of the Deposit of Microorganisms for the Purposes of Patent 
Procedure. This deposit was made merely as a convenience for those of skill in the art and 
is not an admission that a deposit is required under 35 U.S.C. § 1 12. 

5 

Mouse TANGO 281 

A cDNA encoding mouse TANGO 281 was identified in a normal mouse 
megakaryocyte library by performing expression profiling on megakarocytes obtained from 
mice with a the deletion of the element of the gata-1 gene responsible for megakaryocyte- 
10 specific expression. This analysis led to the identification of a clone, Atmea49d3, encoding 
full-length mouse TANGO 281. The mouse TANGO 281 cDNA of this clone is 1858 
nucleotides long (Figure 30; SEQ ID NO:56). The open reading frame of this cDNA, 
nucleotides 90 to 728 of SEQ ID NO:56 (SEQ ID NO:57), encodes a 213 amino acid 
transmembrane protein (Figure 30; SEQ ID NO:58). 
15 The signal peptide prediction program SIGNALP (Nielsen, et al. (1997) Protein 

Engineering 10:1-6) predicted that mouse TANGO 281 includes an 26 amino acid signal 
peptide (amino acid 1 to amino acid 26 of SEQ ID NO:58; SEQ ID NO:59) preceding the 
mature TANGO 281 protein (corresponding to amino acid 27 to amino acid 213 of SEQ ID 
NO:58; SEQ ID NO:60). The molecular weight of mouse TANGO 281 without post- 
20 translational modifications is 22.9 kDa prior to the cleavage of the signal peptide, 20.2 kDa 
after cleavage of the signal peptide. 

Mouse TANGO 281 is a transmembrane protein which contains one or more of the 
following domains: (1) an extracellular domain; (2) a transmembrane domain; and (3) a 
cytoplasmic domain. The mouse TANGO 281 protein contains an extracellular domain at 
25 amino acid residues 27 to 1 1 2 of SEQ ID NO:58 (SEQ ID NO:61), a transmembrane domain 
at amino acid residues 1 13 to 137 of SEQ ID NO:58 (SEQ ID NO:62), and a cytoplasmic 
domain at amino acid residues 138 to 213 of SEQ ID NO:58 (SEQ ID NO:63). 

Figure 32 depicts a hydropathy plot of mouse TANGO 281 . Relatively hydrophobic 
regions of the protein are shown above the horizontal line, and relatively hydrophilic regions 
30 of the protein are below the horizontal line. The cysteine residues (cys) and potential 
N-glycosylation sites (Ngly) are indicated by short vertical lines just below the hydropathy 
trace. The dashed vertical line separates the signal sequence (amino acids 1 to 26 of SEQ ID 
NO:58; SEQ ID NO:59) on the left from the mature protein (amino acids 27 to 213 of SEQ 
ID NO:58; SEQ ID NO:60) on the right. 
35 Mouse TANGO 28 1 comprises photosystem II 1 0 kD phosphoprotein (PSBH) domain 

sequences, which have been shown to be phosphorylated in a light-dependent reaction, from 



-56- 



WO 00/78808 



PCT/US00/16883 



amino acids 42 to 91 and 128 to 183 of SEQ ID NO:58 (SEQ ID NO:64 and SEQ ID NO:65, 
respectively). Two N-glycosylation sites having the sequences NTTT (at amino acids 149 
to 152 of SEQ ID NO:58) and NASS (at about amino 189 to 192 of SEQ ID NO:58) are 
present in TANGO 281. A glycosaminoglycan attachment site having the sequence SGFG 

5 is present in mouse TANGO 281, and protein kinase C phosphorylation site having the 
sequence SSR is present in mouse TANGO 281 . Two casein kinase II phosphorylation sites 
are present in human TANGO 281. The first has the sequence TPAE (at amino acids 80 to 
83 of SEQ ED NO:58), and the second has the sequence SSFD (at amino acids 97 to 100 of 
SEQ ID NO: 58). Mouse TANGO 281 has two N-myristylation sites. The first has the 

10 sequence GSCSNQ (at amino acids 48 to 53 of SEQ ID NO:58), and the second has the 
sequence GATVAI (at amino acids 1 08 to 1 1 3 of SEQ ID NO:58). 

Northern blot analysis of mouse TANGO 281 expression revealed two mRNA bands, 
one of approximately 1 .8 kb and another approximately 1 .4 kb. Expression of the 1 .8 kb band 
was detected in the heart, spleen, lung and kidney, with the greatest abundance detected in the 

1 5 heart and lung, followed by the kidney and trace amounts in the spleen. Expression of the 1 .4 
kb band was detected in the brain, spleen, and lung. Expression of the 1.4 kb and 1.8 kb 
species of mouse TANGO 28 1 was detected in 7 day old normal mouse embryos. Neither the 
1 .4 kb or the 1 .8 kb species of mouse TANGO 281 were detected in 1 1 day old normal mouse 
embryos. The 1.8 kb species of mouse TANGO 281 was detected in 15 day old normal 

20 mouse embryos at 20 % the level detected in 7 day old normal mouse embryos. Expression 
of the 1.8 kb species detected in 17 day old normal mouse embryos was comparable to the 
level of expression detected in 7 day old normal mouse embryos. Expression of mouse 
TANGO 281 expression was greatly reduced in megakaryocytes obtained from gata-1 
knockout mice. 

25 In situ tissue screening was performed on mouse adult and embryonic tissues to 

analyze for the expression of mouse TANGO 281 mRNA. Mouse TANGO 281 expression 
was detected predominantly in the adult lymphoid tissues such as the thymus, lymph node, 
and spleen. In particular, mouse TANGO 281 expression was detected in the following adult 
tissues: a moderate, ubiquitous signal was detected in the submandibular gland; a strong, 

30 ubiquitous signal was detected in the adrenal gland; a strong, multifocal signal was detected 
in the medulla of the thymus and a moderate, ubiquitous signal was detected in the cortex of 
the thymus; a strong signal was detected in the lymph node; a strong signal was detected in 
the follicles of the spleen; a weak signal was detected in the mucosal epithelium of the 
bladder; a strong signal was detected in the ovaries; a ubiquitous signal was detected in the 

35 placenta; a moderate signal was detected in the muscle region of the stomach; a weak signal 
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in a pattern outlining many of the large airways was detected in lung; a weak, ubiquitous 
signal was detected in the liver; and a weak, ubiquitous signal was detected in the kidney. 

In the case of embryonic expression, mouse TANGO 281 expression was detected in 
the lung, stomach, thymus and submaxillary gland. In particular, at El 6. 5 a weak to moderate 
5 signal was detected in the intestine and stomach, and a moderate, ubiquitous signal was 
detected in the lung. At PI. 5, a signal was detected in the lung, stomach, thymus, and 
submaxillary gland. 

Figure 33 shows that there is an overall 66.5% identity between the precursor human 
TANGO 281 amino acid sequence and the precursor mouse TANGO 281 amino acid 
10 sequence. 

Clone EpT281 was deposited with the American Type Culture Collection (10801 
University Boulevard, Manassas, VA 20110-2209) on June 15, 1999 and assigned patent 
deposit Number PTA-224. This deposit will be maintained under the terms of the Budapest 
Treaty on the International Recognition of the Deposit of Microorganisms for the Purposes 
1 5 of Patent Procedure. This deposit was made merely as a convenience for those of skill in the 
art and is not an admission that a deposit is required under 35 U.S.C. §112. 

Uses of TANGO 281 Nucleic acids. Polypeptides, and Modulators Thereof 

As TANGO 281 was originally found in a megakaryocyte library, TANGO 281 

20 nucleic acids, proteins, and modulators thereof can be used to modulate the proliferation, 
differentiation, and/or function of megakaryocytes and platelets. TANGO 28 1 nucleic acids, 
proteins, and modulators thereof can be used to treat associated hematological diseases such 
as thrombocytopenia, platelet disorders and bleeding disorders (e.g., hemophilia). TANGO 
281 nucleic acids, proteins, and modulators thereof can be used to modulate platelet 

25 aggregation and degranulation. Further, as TANGO 281 expression varies in mouse embryos 
during development, TANGO 28 1 nucleic acids, proteins, and modulators thereof can be used 
to modulate the development of cells, tissues or organs in embryos. 

As TANGO 28 1 expression is greatly reduced in megakaryocytes obtained from gata- 
1 knockout mice compared normal mice, TANGO 281 is either a direct or indirect target of 

30 gata-1 and has profound biological implications. Gata-1 is a transcription factor involved in 
the development of hemapoietic cell lineages - gata-1 expression is required for proper 
development of erythocytes and megakaryocytes. Although deletion of the gata-1 gene is 
lethal at the embryonic stage due to a failure to form red blood cells, deletion of only the 
element of the gata-1 gene responsible for megakaryocyte-specific expression (a 1 0 kb region 

35 of genomic DNA containing a megakaryocyte specific DNase I hypersensitive) is not lethal 
and results in a reduction in gata-1 expression in the megakaryocyte without affecting gata-1 
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expression in red blood cells. The megakaryocytes of mice with this element of the gata-1 
gene knocked out fail to develop into mature platelets, and the mice experience abnormal 
bleeding due to their profound thrombocytopenia. TANGO 281 nucleic acids, proteins, and 
modulators thereof can be used to treat disease and/or disorders associated with gata-1 
5 dysfunction. In light of the reduced expression of TANGO 281 in gata-1 knockout mice, 
TANGO 281 expression can be utilized as a marker for modulators of gata-1 expression 
and/or activity. 

As TANGO 281 is expressed in the heart, brain, spleen, lung, kidney, embryo and 
megakaryocytes, TANGO 28 1 nucleic acids, proteins, and modulators thereof can be used to 

10 treat disorders of these cells, tissues, or organs, e.g., ischemic heart disease or atherosclerosis, 
head trauma, brain cancer, splenic lymphoma, splenomegaly, lung cancer, cystic fibrosis, 
rheumatoid lung disease, glomerulonephritis, end stage renal disease, uremia, DiGeorge 
syndrome, thymoma, autoimmune disorders, atresia, Crohns's disease, and various embryonic 
disorders. TANGO 281 nucleic acids, proteins, and modulators thereof can be used to 

1 5 modulate the bleeding associated with uremia. Further, TANGO 28 1 nucleic acids, proteins, 
and modulators thereof can be used to treat hypercoagulation associated with a damaged 
endothelium, e.g., pre-eclampsia, malignant hypertension, disseminated intravascular 
coagulopathy, renal transplant rejection, cyclosporin toxicity, microangiopathic hemolytic 
anemia, and thrombotic thrombocytopenic purpura. 

20 TANGO 28 1 exhibits homology to a gene referred to as "gene 3 1 " (PCT Publication 

No. W098/39446), which is expressed primarily in the brain and thymus. In light of this, 
TANGO 281 nucleic acids, proteins and modulators thereof can be utilized to ameliorate at 
least one symptom associated with central nervous (CNS) disorders, hematopoietic disorder, 
and disorders of the endocrine system. 

25 Further, in light of TANGO 28 Vs pattern of expression in mice, TANGO 281 

expression can be utilized as a marker for specific tissues (e.g., lymphoid tissues such as the 
thymus and spleen) and/or cells (e.g., lymphocytes) in which INTERCEPT 281 is expressed. 
TANGO 281 nucleic acids can also be utilized for chromosomal mapping. 

Tables 1-4 below provide a summary of the sequence information for TANGO 253, 

30 TANGO 257, INTERCEPT 258 and TANGO 28 1 . 



TABLE 1 : Summary of Human TANGO 253, TANGO 257, INTERCEPT 258, and 
TANGO 281 Sequence Information 



Gene 


cDNA 


ORF 


Figure 


Accession 
Number 


TANGO 253 


SEQ ED NO:l 


SEQ ID NO:2 


Figure 1 


207222 


TANGO 257 


SEQIDN0:15 


SEQ ID NO: 16 


Figures 9A-9B 


207222 
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INTERCEPT 
258 


SEQIDNO:26 


SEQIDNO:27 


Figure 17 


207222 


TANGO 281 


SEQ ID NO:46 


SEQ ED NO:47 


Figures 27 


207222 



5 



10 



15 



20 



25 



30 



35 
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Cytoplasmic 






aa 225-246 
of 

SEQID NO:28 
(SEQ ID NO:79) 


aa 149-245 
of 

SEQ ID NO:48 
(SEQ ID NO:53) 


Transmembrane 






aa 207-224 
of 

SEQIDNO:28 

(SEQ IDNO:78); 

aa 247-271 of 
SEQ ID NO: 28 j 

(SEQ ID NO: 33) 


aa 124-148 
of 

SEQ ID NO:48 
(SEQIDNO:52) 


Collagen 


aa 36-45 
of 

SEQIDNO:3 (SEQ 
ID NO:6) 








Clq 


aa 102-232 
of 

SEQ IDNO:3 
(SEQIDNO:7) 














aa 49-128; 

167-226 of 
SEQ ID 
NO:28 
(SEQID 

SEQID 
NO:36) 




PSBH 








aa 41-90; 12- 
187 

of 

SEQID 
NO:48 

(SEQID 
NO:54; SEQ 
ID NO:55) 


Extracellular 






aa 3 0-206 of 
SEQ ID NO: 28 
(SEQ ID NO: 
76) 

aa 272-370 of 
SEQ ID NO; 28 
(SEQ ID NO: 
34) 


aa 39-123 
of 

SEQ ID NO:48 
(SEQIDNO:51) 


Mature Protein 


aa 16-243 
of 

SEQIDNO:3 
(SEQIDNO:4) 


aa 22-406 
of 

SEQ ID NO: 17 
(SEQ ID NO: 1 8) 


aa 30-370 
of 

SEQ IDNO:28 
(SEQ ID NO:29) 


aa 39-245 
of 

SEQ ID NO:48 
(SEQIDNO:50) 


Signal 
Sequence 


£ §§ 

WW 

GO W 


ST 

Z oQ 9 
S oo 
w 2 

GOg 


aa 1-29 
of 

SEQIDNO:28 
(SEQIDNO:30) 


aa 1-38 
of 

SEQIDNO:48 
(SEQIDNO:49) 


Protein 


TANGO 253 


TANGO 257 


INTERCEPT 
258 


TANGO 281 
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TABLE 3: Summary of Mouse TANGO 253, TANGO 257, INTERCEPT 258 and 
TANGO 281 Sequence Information 



Gene 


cDNA 


ORF 


Figure 


Accession 
Number 


TANGO 253 


SEQ ID NO:8 


SEQ ID NO:9 


Figures 3A-3B 


207215 


TANGO 257 


SEQIDNO:21 


SEQ ID NO:22 


Figures 11A-11B 


207217 


INTERCEPT 
258 


SEQIDNO:37 


SEQ ID NO:38 


Figures 20A-20B 


207221 


TANGO 281 


SEQIDNO:56 


SEQ ID NO:57 


Figures 31 A-3 IB 


PTA-224 
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Various aspects of the invention are described in further detail in the following subsections: 
I. Isolated Nucleic Acid Molecules 

One aspect of the invention pertains to isolated nucleic acid molecules that encode 
a polypeptide of the invention or a biologically active portion thereof, as well as nucleic 
acid molecules sufficient for use as hybridization probes to identify nucleic acid molecules 
encoding a polypeptide of the invention and fragments of such nucleic acid molecules 
suitable for use as PCR primers for the amplification or mutation of nucleic acid 
molecules. As used herein, the term "nucleic acid molecule" is intended to include DNA 
molecules (e.g., cDNA or genomic DNA) and RNA molecules (e.g., mRNA) and analogs 
of the DNA or RNA generated using nucleotide analogs. The nucleic acid molecule can 
be single-stranded or double-stranded, but preferably is double-stranded DNA. In one 
embodiment, the nucleic acid molecules of the invention comprise a contiguous open 
reading frame encoding a polypeptide of the invention. 

15 An "isolated" nucleic acid molecule is one which is separated from other nucleic 

acid molecules which are present in the natural source of the nucleic acid molecule. 
Preferably, an "isolated" nucleic acid molecule is free of sequences (preferably protein 
encoding sequences) which naturally flank the nucleic acid (i.e., sequences located at the 
5' and 3' ends of the nucleic acid) in the genomic DNA of the organism from which the 

20 nucleic acid is derived. For example, in various embodiments, the isolated nucleic acid 
molecule can contain less than about 5 kB, 4 kB, 3 kB, 2 kB, 1 kB, 0.5 kB or 0.1 kB of 
nucleotide sequences which naturally flank the nucleic acid molecule in genomic DNA of 
the cell from which the nucleic acid is derived. Moreover, an "isolated" nucleic acid 
molecule, such as a cDNA molecule, can be substantially free of other cellular material, or 

25 culture medium when produced by recombinant techniques, or substantially free of 

chemical precursors or other chemicals when chemically synthesized. As used herein, the 
term "isolated" when referring to a nucleic acid molecule does not include an isolated 
chromosome. 

A nucleic acid molecule of the present invention, e.g., a nucleic acid molecule 
30 having the nucleotide sequence of SEQ ID NO:l, 2, 8, 9, 15, 16, 21, 22, 26, 27, 37, 38, 46, 
47, 56, 57, 77, 80, 91, 100, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 
125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 
161, 163, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 
181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191 or 192, or a complement thereof, 
35 can be isolated using standard molecular biology techniques and the sequence information 
provided herein. Using all or a portion of the nucleic acid sequences of SEQ ID NO: 1 , 2, 
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8, 9, 15, 16, 21, 22, 26, 27, 37, 38, 46, 47, 56, 57, 77, 80, 91, 100, 101, 103, 105, 107, 109, 
111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 
147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 166, 167, 168, 169, 170, 171, 172, 173, 
174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191 
5 or 192, as a hybridization probe, nucleic acid molecules of the invention can be isolated 
using standard hybridization and cloning techniques (e.g., as described in Sambrook et al., 
eds., Molecular Cloning: A Laboratory Manual, 2nd ed. t Cold Spring Harbor 
Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 1989). 

A nucleic acid molecule of the invention can be amplified using cDNA, mRNA or 
10 genomic DNA as a template and appropriate oligonucleotide primers according to 

standard PCR amplification techniques. The nucleic acid so amplified can be cloned into 
an appropriate vector and characterized by DNA sequence analysis. Furthermore, 
oligonucleotides corresponding to all or a portion of a nucleic acid molecule of the 
invention can be prepared by standard synthetic techniques, e.g., using an automated DNA 
15 synthesizer. 

In another preferred embodiment, an isolated nucleic acid molecule of the 
invention comprises a nucleic acid molecule which is a complement of the nucleotide 
sequence of SEQ ID NO:l, 2, 8, 9, 15, 16, 21, 22, 26, 27, 37, 38, 46, 47, 56, 57, 77, 80, 
91, 100, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 
133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 166, 
167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 
185, 186, 187, 188, 189, 190, 191 or 192, or the nucleotide sequence of the cDNA insert 
of a clone deposited with the ATCC® as Accession number 207222, Accession Number 
207215, Accession number 207217, Accession Number 207221 or patent deposit Number 
PTA-224, or a portion thereof. A nucleic acid molecule which is complementary to a 
given nucleotide sequence is one which is sufficiently complementary to the given 
nucleotide sequence that it can hybridize to the given nucleotide sequence thereby forming 
a stable duplex. 

30 Moreover, a nucleic acid molecule of the invention can comprise only a portion of 

a nucleic acid sequence encoding a full length polypeptide of the invention for example, a 
fragment which can be used as a probe or primer or a fragment encoding a biologically 
active portion of a polypeptide of the invention. The nucleotide sequence determined from 
the cloning one gene allows for the generation of probes and primers designed for use in 

35 identifying and/or cloning homologues in other cell types, e.g, from other tissues, as well 
as homologues from other mammals. The probe/primer typically comprises substantially 
purified oligonucleotide. In one embodiment, the oligonucleotide comprises a region of 
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nucleotide sequence that hybridizes under stringent conditions to at least about 12, 
preferably 25, more preferably about 50, 75, 100, 125, 150, 175, 200, 250, 300, 350 or 400 
consecutive oligonucleotides of the sense or anti-sense sequence of SEQ ID NO:l, 2, 8, 9, 
15, 16, 21, 22, 26, 27, 37, 38, 46, 47, 56, 57, 77, 80, 91, 100, 101, 103, 105, 107, 109, 111, 

5 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 
149, 151, 153, 155, 157, 159, 161, 163, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 
175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191 or 
192, or the nucleotide sequence of the cDNA insert of a clone deposited with the ATCC® 
as Accession number 207222, Accession Number 20721 5, Accession Number 207217, 

1 0 Accession Number 20722 1 , or patent deposit Number PTA-224, or of a naturally 

occurring mutant of SEQ ID NO: 1, 2, 8, 9, 15, 16, 21, 22, 26, 27, 37, 38, 46, 47, 56, 57, 
77, 80,91, 100, 101, 103, 104, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 
129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 
165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 

15 183, 184, 185, 186, 187, 188, 189, 190, 191 or 192. In another embodiment, the 

oligonucleotide comprises a region of nucleotide sequence that hybridizes under stringent 
conditions to at least 400, preferably 450, 500, 530, 550, 600, 700, 750, 800, 850, 900, 
1000, 1 100, 1200 or more consecutive oligonucleotides of the sense of antisense sequence 
of SED ID NO: 1, 2, 8, 9, 15, 16, 21, 22, 26, 27, 37, 38, 46, 47, 56, 57, 77, 80, 91, 100, 

20 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 
137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 166, 167, 168, 
169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 
187, 188, 189, 190, 191 or 192 , or the nucleotide sequence of the cDNA insert of a clone 
deposited with the ATCC® as Accession number 207222, Accession number 207215, 

25 Accession number 2072 1 7, Accession Number 20722 1 , or patent deposit Number PTA- 
224, or of a naturally occurring mutant of SEQ ID NO: 1, 2, 8, 9, 15, 16, 21, 22, 26, 27, 
37,38,46,47, 56, 57, 77, 80,91, 100, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 
121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 
157, 159, 161, 163, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 

30 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191 or 192. 

In a preferred embodiment, the oligonucleotide typically comprises a region of 
nucleotide sequence that hybridizes under stringent conditions to at least about 450, 
preferably about 500, 550, 600, 650, 700, 750, 800, 850, 900, 1000, 1 100 or 1300 
consecutive nucleotides of the sense or anti-sense sequence of SEQ ID NO:l, 103, 105, 
35 107 or 109, or a naturally occurring mutant of SEQ ID NO:l, 103, 105, 107, or 109. In 
another preferred embodiment, the oligonucleotide typically comprises a region of 
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nucleotide sequence that hybridizes under stringent conditions to at least about 12, 
preferably 25, 50, 100, 150, 200, 250, 300, 350, 400, 450, 500, 550, 600, 650, 700 or 720 
consecutive nucleotides of the sense or anti-sense sequence of SEQ ID NO:2, 91, 100, 101 
or 80. 

^ In another preferred embodiment, the oligonucleotide typically comprises a region 

of nucleotide sequence that hybridizes under stringent conditions to at least about 540, 
preferably about 600, 650, 700, 750, 800, 850, 900, 950, 1000, 1 100, 1200 or 1250 
consecutive nucleotides of the sense or anti-sense sequence of SEQ ID NO:8, 1 19, 121, 
123 or 125, or of a naturally occurring mutant of SEQ ID NO:8, 119, 121, 123 or 125. In 

1 0 another preferred embodiment, the oligonucleotide typically comprises a region of 
nucleotide sequence that hybridizes under stringent conditions to at least about 310, 
preferably about 350, 400, 450, 500, 550, 600, 650 or 700 consecutive nucleotides of the 
sense or anti-sense sequence of SEQ ID NO:9, 174, 175, 176 or 177, or of a naturally 
occurring mutant of SEQ ID NO:9, 174, 175, 176 or 177. 

15 

In another preferred embodiment, the oligonucleotide typically comprises a region 
of nucleotide sequence that hybridizes under stringent conditions to at least about 1 800 
consecutive nucleotides of the sense or anti-sense sequence of SEQ ID NO: 15, 111,113, 
115or 117, or ofanaturally occurring mutant ofSEQ ID NO:15, 111, 113, 115 or 117. In 
2Q another preferred embodiment, the oligonucleotide typically comprises a region of 
nucleotide sequence that hybridizes under stringent conditions to at least about 1 150 
consecutive nucleotides of the sense or anti-sense sequence of SEQ ID NO: 16, 170, 171, 
172 or 173, or of a naturally occurring mutant of SEQ ID NO: 16, 170, 171, 172 or 173. 

In another preferred embodiment, the oligonucleotide typically comprises a region 
25 of nucleotide sequence that hybridizes under stringent conditions to at least about 1 100, 
preferably about 1200, 1300, 1400, 1500, 16500 or 1700 consecutive nucleotides of the 
sense or anti-sense sequence of SEQ ID NO:21, 127, 129, 131 or 133, or of a naturally 
occurring mutant of SEQ ID NO:21, 127, 129, 131 or 133. In another preferred 
embodiment, the oligonucleotide typically comprises a region of nucleotide sequence that 
30 hybridizes under stringent conditions to at least about 1 150 consecutive nucleotides of the 
sense or anti-sense sequence of SEQ ID NO:22, 1 78, 1 79, 1 80 or 1 8 1 , or of a naturally 
occurring mutant of SEQ ID NO:22, 178, 179, 180 or 181. 

In another preferred embodiment, the oligonucleotide typically comprises a region 
of nucleotide sequence that hybridizes under stringent conditions to at least about 420, 
35 preferably about 450, 500, 600, 700, 800, 900, 1000, 1 100, 1200, 1300, 1400, 1500, 1600, 
1700 or 1800 consecutive nucleotides of the sense or anti-sense sequence of SEQ ID 
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NO:26, 135, 137, 139 or 141, or of a naturally occurring mutant of SEQ ID NO:26, 135, 
137, 139 or 141. In another preferred embodiment, the oligonucleotide typically 
comprises a region of nucleotide sequence that hybridizes under stringent conditions to at 
least about 12, preferably about 25, more preferably about 50, 100, 200, 300, 400, 500, 
5 600, 700, 800, 900, 1 000, 1 1 00, 1 200, 1 300, 1 400, 1 500, 1 600, 1 700 or 1 800 consecutive 
nucleotides of the sense or anti-sense sequence of SEQ ID NO:27, 182, 183, 184 or 185, 
or of a naturally occurring mutant of SEQ ID NO:27, 182, 183, 184 or 185. 

In another preferred embodiment, the oligonucleotide typically comprises a region 
of nucleotide sequence that hybridizes under stringent conditions to at least about 675, 

10 preferably about 700, 800, 900, 1000, 1200, 1300, 1400, 1500, 1600, 1700 or 1800 

consecutive nucleotides of the sense or anti-sense sequence of SEQ ID NO:37, 143, 145, 
147 or 149, or of a naturally occurring mutant of SEQ ID NO:37, 143, 145, 147 or 149. In 
another preferred embodiment, the oligonucleotide typically comprises a region of 
nucleotide sequence that hybridizes under stringent conditions to at least about 500, 

15 preferably about 600, 700, 800, 900, 1000, 1 100, 1200, 1300, 1400, 1500, 1600, 1700 or 
1 800 consecutive nucleotides of the sense or anti-sense sequence of SEQ ID NO:38, 1 86, 
187, 188 or 189, or of a naturally occurring mutant of SEQ ID NO:38, 186, 187, 188 or 
189. 

In another preferred embodiment, the oligonucleotide typically comprises a region 
of nucleotide sequence that hybridizes under stringent conditions to at least about 12, 
preferably about 25, more preferably about 50, 100, 200, 300, 400, 500, 600, 700, 800, 
900, 1000, 1100, 1200, 1300, 1400, 1500, 1600, 1700 or 1800 consecutive nucleotides of 
the sense or anti-sense sequence of SEQ ID NO:46, 151, 153, 155 or 157, or of a naturally 
occurring mutant of SEQ ID NO:46, 151, 153, 155 or 157. In another preferred 

25 

embodiment, the oligonucleotide typically comprises a region of nucleotide sequence that 
hybridizes under stringent conditions to at least about 12, preferably about 25, more 
preferably about 50, 100, 200, 300, 400, 500, 600, 700, 800, 900, 1000, 1100, 1200, 1300, 
1400, 1500, 1600, 1700 or 1800 consecutive nucleotides of the sense or anti-sense 
3o sequence of SEQ ID NO:47, 190, 191, 192 or 77, or of a naturally occurring mutant of 
SEQ ID NO:47, 190, 191, 192 or 77. 

In another preferred embodiment, the oligonucleotide typically comprises a region 
of nucleotide sequence that hybridizes under stringent conditions to at least about 55.0, 
preferably about 600, 700, 800, 900, 1000, 1100, 1200, 1300, 1400, 1500, 1600, 1700, 
35 1 800 or 1 850 consecutive nucleotides of the sense or anti-sense sequence of SEQ ID 
NO:56, 159, 161, 163 or 165, or of a naturally occurring mutant of SEQ ED NO:56, 159, 
161, 163 or 165. In another preferred embodiment, the oligonucleotide typically 
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comprises a region of nucleotide sequence that hybridizes under stringent conditions to at 
least about 12, preferably about 25, more preferably about 50, 100, 200, 300, 400, 500, 
600 or 700 consecutive nucleotides of the sense or anti-sense sequence of SEQ ID NO:57, 
166, 167, 168 or 169, or of a naturally occurring mutant of SEQ ID NO:57, 166, 167, 168 
5 or 169. 

Probes based on the sequence of a nucleic acid molecule of the invention can be 
used to detect transcripts or genomic sequences encoding the same protein molecule 
encoded by a selected nucleic acid molecule. The probe comprises a label group attached 
thereto, eg., a radioisotope, a fluorescent compound, an enzyme, or an enzyme co-factor. 
10 Such probes can be used as part of a diagnostic test kit for identifying cells or tissues 
which mis-express the protein, such as by measuring levels of a nucleic acid molecule 
encoding the protein in a sample of cells from a subject, e.g., detecting mRNA levels or 
determining whether a gene encoding the protein has been mutated or deleted. 

A nucleic acid fragment encoding a biologically active portion of a polypeptide of 
the invention can be prepared by isolating a portion of any of SEQ ID NO:3, 10, 17, 23, 
28, 39, 48, 58, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 
132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162 or 164 
expressing the encoded portion of the polypeptide protein (e.g., by recombinant expression 
^ in vitro) and assessing the activity of the encoded portion of the polypeptide. 

The invention further encompasses nucleic acid molecules that differ from the 
nucleotide sequence of SEQ ID NO:l, 2, 8, 9, 15, 16, 21, 22, 26, 27, 37, 38, 46, 47, 56, 57, 
77, 80, 91, 100, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 
129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 

25 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 
183, 184, 185, 186, 187, 188, 189, 190, 191 or 192, or the nucleotide sequence of the 
cDNA insert of a clone deposited with the ATCC® as Accession Number 207222, 
Accession Number 207215, Accession Number 207217, Accession Number 207221 or 
patent deposit number PTA-224 due to degeneracy of the genetic code and thus encode the 

30 same protein as that encoded by the nucleotide sequence of SEQ ID NO:l, 2, 8, 9, 15, 16, 
21, 22, 26, 27, 37, 38, 46, 47, 56, 57, 77, 80, 91, 100, 101, 103, 105, 107, 109, 111, 113, 
115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 
151, 153, 155, 157, 159, 161, 163, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 
176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191or 192, or 

35 the nucleotide sequence of the cDNA insert of a clone deposited with the ATCC® as 
Accession Number 207222, Accession Number 207215, Accession Number 207217, 
Accession Number 207221 or patent deposit Number PTA-224. 
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In addition to the nucleotide sequences of SEQ ID NO:l, 2, 8, 9, 15, 16, 21, 22, 26, 
27, 37, 38, 46, 47, 56, 57, 77, 80, 91, 100, 101, 103, 105, 107, 109, 111, 113, 115, 117, 
119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 
155, 157, 159, 161, 163, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 
5 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191or 192, it will be 
appreciated by those skilled in the art that DNA sequence polymorphisms that lead to 
changes in the amino acid sequence may exist within a population (e.g., the human 
population). Such genetic polymorphisms may exist among individuals within a 
population due to natural allelic variation. 

1 ^ An allele is one of a group of genes which occur alternatively at a given genetic 

locus. As used herein, the phrase "allelic variant" refers to a nucleotide sequence which 
occurs at a given locus or to a polypeptide encoded by the nucleotide sequence. As used 
herein, the terms "gene" and "recombinant gene" refer to nucleic acid molecules 
comprising an open reading frame encoding a polypeptide of the invention. Such natural 

* 5 allelic variations can typically result in 1-5% variance in the nucleotide sequence of a 
given gene. Alternative alleles can be identified by sequencing the gene of interest in a 
number of different individuals. This can be readily carried out by using hybridization 
probes to identify the same genetic locus in a variety of individuals. Any and all such 
nucleotide variations and resulting amino acid polymorphisms or variations that are the 

20 result of natural allelic variation and that do not alter the functional activity are intended to 
be within the scope of the invention. 

The human gene for TANGO 253 has been mapped to the long arm of 
chromosome 1 1 . Flanking markers for this region are Dl 1 5 1 356 and Dl 1 5924. The 
Jacobsen syndrome (JBS), ED4 (ectodermal dysplasia 4), CMT4B (Charcot Marie Tooth 
neuropathy), PORC (porphyria, acute) loci also map to this region of the human 
chromosome. The APOPLP1 (apolipoprotein cluster), DRD2 (dopamine receptor 2), 
PGL1 (paraganglioma glomus tumors), RDX (radixin), NCAM1 (neural cell adhesion 
molecule), ARCN1 (archain 1), and IL-10R (BL-10 receptor) genes map to this region of 
the human chromosome. This region is syntenic to mouse chromosome 9. The ruf (rough 
fur), lu (luxoid), and atm (ataxia telangiectasia gene mutated in human being) loci also 
mpa to this region of the mouse chromosome. The ruf (rough for), lu (luxoi), hmbs 
(hydroxymethylbilane synthase), IL-lORa (IL-10 receptor a), and drd2 (dopamine 
receptor 2) genes also map to this region of the mouse chromosome. 

35 The human gene for TANGO 257 has been mapped to chromosome 1 . Flanking 

markers for ths region are WI-7614 and FB14F9. The WS2B (Waardenburg syndrome) 
loci also maps to this region of the human chromosome. The NGF-P (nerve growth 
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factor-P), TSHB (thyroid stimulating hormone), and GSTM1 (glutathione S-transferase 
cluster) genes also map to this region of the human chromosome. This region is syntenic 
to mouse chromosome 3. The de (droopy ear) loci maps to this region of the mouse 
chromosome. The NGF-0 (nerve growth factor-p), TSHB (thyroid stimulating hormone), 
5 and BCAN (brevican) genes also map to this region of the mouse chromosome. 

The human gene for INTERCEPT 258 has been mapped to the long arm of 
chromosome 1 1, in the region q23. Flanking markers for this region are Dl 1S936 and 
Dl 1S933. The CMT4B (Charcot Marie Tooth neuropathy), ED4 (ecotodermal dysplasia), 
JBS (Jacobsen Syndrome), and TCPT (thrombocytopenia) loci also map to this region of 

10 the human chromosome. The APOLP1 (apolipoprotein cluster), DRD2 (dopamine 
receptor), and RDX (radixin) genes also map to this region of the human chromosome. 
This region is syntenic to mouse chromosome 9. The atm (ataxia telangiectasia), ruf 
(rough fur), and vs (variable spotting) loci map to this region of the mouse chromosome. 
The lu (luxoid), vs (variable spotting), atm (ataxia telangiectasia), rug (rough fur), and 

1 5 lapl (leucine arylaminopeptidase) genes also map to this region of the mouse 
chromosome. 

Moreover, nucleic acid molecules encoding proteins of the invention from other 
species (homologues), which have a nucleotide sequence which differs from that of the 

^ human or mouse protein described herein are intended to be within the scope of the 
invention. Nucleic acid molecules corresponding to natural allelic variants and 
homologues of a cDNA of the invention can be isolated based on their identity to the 
human nucleic acid molecule disclosed herein using the human cDNAs, or a portion 
thereof, as a hybridization probe according to standard hybridization techniques under 

^ stringent hybridization conditions. For example, a cDNA encoding a soluble form of a 
membrane-bound protein of the invention isolated based on its hybridization to a nucleic 
acid molecule encoding all or part of the membrane-bound form. Likewise, a cDNA 
encoding a membrane-bound form can be isolated based on its hybridization to a nucleic 
acid molecule encoding all or part of the soluble form. 

30 Accordingly, in another embodiment, an isolated nucleic acid molecule of the 

invention is at least 450, 500, 550, 600, 650, 700, 750, 800, 850, 900, 1000, 1 100, 1200 or 
1300 contiguous nucleotides in length and hybridizes under stringent conditions to the 
nucleic acid molecule comprising the nucleotide sequence, preferably the coding 
sequence, of SEQ ID NO:l, 103, 105, 107 or 109, or a complement thereof. 

35 Accordingly, in another embodiment, an isolated nucleic acid molecule of the 

invention is at least 50, 100, 150, 200, 250, 300, 350, 400, 450, 500, 550, 600, 650, 700 or 
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720 contiguous nucleotides in length and hybridizes under stringent conditions to the 
nucleic acid molecule comprising the nucleotide sequence, preferably the coding 
sequence, of SEQ ID NO:2, 80, 91, 100 or 101, or a complement thereof. 

Accordingly, in another embodiment, an isolated nucleic acid molecule of the 
5 invention is at least 540, 600, 650, 700, 750, 800, 850, 900, 950, 1000, 1 100, 1200 or 1250 
contiguous nucleotides in length and hybridizes under stringent conditions to the nucleic 
acid molecule comprising the nucleotide sequence, preferably the coding sequence, of 
SEQ ID NO:8, 119, 121, 123 or 125, or a complement thereof 

Accordingly, in another embodiment, an isolated nucleic acid molecule of the 
invention is at least 310, 350, 400, 450, 500, 550, 600, 650 or 700 contiguous nucleotides 
in length and hybridizes under stringent conditions to the nucleic acid molecule 
comprising the nucleotide sequence, preferably the coding sequence, of SEQ ID NO:9, 
174, 175, 176 or 177, or a complement thereof. 

15 Accordingly, in another embodiment, an isolated nucleic acid molecule of the 

invention is at least 1800 contiguous nucleotides in length and hybridizes under stringent 
conditions to the nucleic acid molecule comprising the nucleotide sequence, preferably the 
coding sequence, of SEQ ID NO: 15, 1 1 1, 1 13, 1 15 or 1 17, or a complement thereof. 

Accordingly, in another embodiment, an isolated nucleic acid molecule of the 
20 invention is at least 1 150 or 1200 contiguous nucleotides in length and hybridizes under 
stringent conditions to the nucleic acid molecule comprising the nucleotide sequence, 
preferably the coding sequence, of SEQ ID NO:16, 170, 171, 172 or 173, or a complement 
thereof 

Accordingly, in another embodiment, an isolated nucleic acid molecule of the 

25 

invention is at least 1100, 1200, 1300, 1400, 1500, 1600 or 1700 contiguous nucleotides 
in length and hybridizes under stringent conditions to the nucleic acid molecule 
comprising the nucleotide sequence, preferably the coding sequence, of SEQ ID NO:21, 
127, 129, 131 or 133, or a complement thereof. 

2Q Accordingly, in another embodiment, an isolated nucleic acid molecule of the 

invention is at least 1 150 or 1200 contiguous nucleotides in length and hybridizes under 
stringent conditions to the nucleic acid molecule comprising the nucleotide sequence, 
preferably the coding sequence, of SEQ ID NO:22, 178, 179, 180 or 181, or a complement 
thereof 

35 Accordingly, in another embodiment, an isolated nucleic acid molecule of the 

invention is at least 420, 450, 500, 600, 700, 800, 900, 1000, 1 100, 1200, 1300, 1400, 
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1500, 1600, 1700, or 1800 contiguous nucleotides in length and hybridizes under stringent 
conditions to the nucleic acid molecule comprising the nucleotide sequence, preferably the 
coding sequence, of SEQ ID No:26, 135, 137, 139 or 141, or a complement thereof. 

Accordingly, in another embodiment, an isolated nucleic acid molecule of the 
5 invention is at least 50, 100, 200, 300, 400, 500, 600, 700, 800, 900, 1000, 1 100, 1200, 
1300, 1400, 1500, 1600, 1700 or 1800 contiguous nucleotides in length and hybridizes 
under stringent conditions to the nucleic acid molecule comprising the nucleotide 
sequence, preferably the coding sequence, of SEQ ID NO:27, 182, 183, 184 or 185, or a 
complement thereof. 

10 

Accordingly, in another embodiment, an isolated nucleic acid molecule of the 
invention is at least 675, 700, 800, 900, 1000, 1 100, 1200, 1300, 1400, 1500, 1600, 1700 
or 1800 contiguous nucleotides in length and hybridizes under stringent conditions to the 
nucleic acid molecule comprising the nucleotide sequence, preferably the coding 
sequence, of SEQ ID NO:37, 143, 145, 147 or 149, or a complement thereof. 

Accordingly, in another embodiment, an isolated nucleic acid molecule of the 
invention is at least 500, 600, 700, 800, 900, 1000, 1100, 1200, 1300, 1400, 1500, 1600, 
1700 or 1800 contiguous nucleotides in length and hybridizes under stringent conditions 
to the nucleic acid molecule comprising the nucleotide sequence, preferably the coding 
20 sequence, of SEQ ID NO:38, 1 86, 1 87, 1 88 or 1 89, or a complement thereof. 

Accordingly, in another embodiment, an isolated nucleic acid molecule of the 
invention is at least 50, 100, 200, 300, 400, 500, 600, 700, 800, 900, 1000, 1 100, 1200, 
1300, 1400, 1500, 1600, 1700 or 1800 contiguous nucleotides in length and hybridizes 
under stringent conditions to the nucleic acid molecule comprising the nucleotide 
25 sequence, preferably the coding sequence, of SEQ ID NO:46, 1 5 1 , 1 53, 1 55 or 1 57, or a 
complement thereof. 

Accordingly, in another embodiment, an isolated nucleic acid molecule of the 
invention is at least 50, 100, 200, 300, 400, 500, 600, 700 or 750 contiguous nucleotides in 
length and hybridizes under stringent conditions to the nucleic acid molecule comprising 
the nucleotide sequence, preferably the coding sequence, of SEQ ID NO:47, 77 190, 191 
or 192, or a complement thereof. 

Accordingly, in another embodiment, an isolated nucleic acid molecule of the 
invention is at least 550, 600, 700, 800, 900, 1000, 1 100, 1200, 1300, 1400, 1500, 1600, 
35 1 700, 1 800 or 1 850 contiguous nucleotides in length and hybridizes under stringent 

conditions to the nucleic acid molecule comprising the nucleotide sequence, preferably the 
coding sequence, of SEQ ID NO:56, 159, 161, 163 or 165, or a complement thereof. 
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Accordingly, in another embodiment, an isolated nucleic acid molecule of the 
invention is at least 50, 100, 200, 300, 400, 500, 600 or 700 contiguous nucleotides in 
length and hybridizes under stringent conditions to the nucleic acid molecule comprising 
the nucleotide sequence, preferably the coding sequence, of SEQ ED NO:57, 166, 167, 168 
5 or 1 69, or a complement thereof. 

As used herein, the term "hybridizes under stringent conditions 11 is intended to 
describe conditions for hybridization and washing under which nucleotide sequences at 
least 60%, 65%, 70%, preferably 75%, identical to each other typically remain hybridized 
to each other. Such stringent conditions are known to those skilled in the art and can be 

10 found in Current Protocols in Molecular Biology, John Wiley & Sons, N.Y. (1989), 
6.3.1-6.3.6. A preferred, non-limiting example of stringent hybridization conditions are 
hybridization in 6X sodium chloride/sodium citrate (SSC) at about 45° C, followed by one 
or more washes in 0.2 X SSC, 0.1% SDS at 50-65° C. Preferably, an isolated nucleic acid 
molecule of the invention that hybridizes under stringent conditions to the sequence of 

15 SEQ ID NO:l, 2, 8, 9, 15, 16, 21, 22, 26, 27, 37, 38, 46, 47, 56, 57, 77, 80, 91, 100, 101, 
103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 
139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 166, 167, 168, 169, 
170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 
188, 189, 190, 191 or 192 , or a complement thereof, corresponds to a naturally-occurring 

20 nucleic acid molecule. As used herein, a "naturally-occurring" nucleic acid molecule 
refers to an RNA or DNA molecule having a nucleotide sequence that occurs in nature 
(e.g., encodes a natural protein). 

In addition to naturally-occurring allelic variants of a nucleic acid molecule of the 
invention sequence that may exist in the population, the skilled artisan will further 
appreciate that changes can be introduced by mutation thereby leading to changes in the 
amino acid sequence of the encoded protein, without altering the biological activity of the 
protein. For example, one can make nucleotide substitutions leading to amino acid 
substitutions at "non-essential" amino acid residues. A "non-essential" amino acid residue 
is a residue that can be altered from the wild-type sequence without altering the biological 
activity, whereas an "essential" amino acid residue is required for biological activity. For 
example, amino acid residues that are not conserved or only semi -conserved among 
homologues of various species may be non-essential for activity and thus would be likely 
targets for alteration. Specific examples of conservative amino acid alterations from the 
original amino acid sequence of SEQ ID NO:3, 10, 17, 23, 28, 39, 48 or 58 are shown in 
SEQ ID NO: 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 
132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162 or 164. 



25 



30 
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Alternatively, amino acid residues that are conserved among the homologues of various 
species {e.g., mouse and human) may be essential for activity and thus would not be likely 
targets for alteration. 

Accordingly, another aspect of the invention pertains to nucleic acid molecules 
5 encoding a polypeptide of the invention that contain changes in amino acid residues that 
are not essential for activity. Such polypeptides differ in amino acid sequence from SEQ 
ID NO:3, 102, 104, 106 or 108, yet retain biological activity. In one embodiment, the 
isolated nucleic acid molecule includes a nucleotide sequence encoding a protein that 
includes an amino acid sequence that is at least about 40%, 45%, 50%, 55%, 60%, 65%, 
10 75%, 85%, 95%, or 98% identical to the amino acid sequence of SEQ ID NO:3, 102, 104, 
106 or 108. 

Accordingly, another aspect of the invention pertains to nucleic acid molecules 
encoding a polypeptide of the invention that contain changes in amino acid residues that 
are not essential for activity. Such polypeptides differ in amino acid sequence from SEQ 
ID NO:10, 1 18, 120, 122 or 124 yet retain biological activity. In one embodiment, the 
isolated nucleic acid molecule includes a nucleotide sequence encoding a protein that 
includes an amino acid sequence that is at least about 95%, or 98% identical to the amino 
acid sequence of SEQ ID NO: 10, 118, 120, 122 or 124. 

20 Accordingly, another aspect of the invention pertains to nucleic acid molecules 

encoding a polypeptide of the invention that contain changes in amino acid residues that 
are not essential for activity. Such polypeptides differ in amino acid sequence from SEQ 
ID NO: 17, 1 10, 1 12, 1 14 or 1 16 yet retain biological activity. In one embodiment, the 
isolated nucleic acid molecule includes a nucleotide sequence encoding a protein that 

25 includes an amino acid sequence that is at least about 88%, 90%, 95% or 98% identical to 
the amino acid sequence of SEQ ID NO: 17, 110, 112, 114 or 116. 

Accordingly, another aspect of the invention pertains to nucleic acid molecules 
encoding a polypeptide of the invention that contain changes in amino acid residues that 
are not essential for activity. Such polypeptides differ in amino acid sequence from SEQ 
30 ID NO:23, 126, 128, 130 or 132 yet retain biological activity. In one embodiment, the 
isolated nucleic acid molecule includes a nucleotide sequence encoding a protein that 
includes an amino acid sequence that is at least about 88%, 90%, 95%,- or 98% identical to 
the amino acid sequence of SEQ ID NO:23, 126, 128, 130 or 132. 

Accordingly, another aspect of the invention pertains to nucleic acid molecules 

35 

encoding a polypeptide of the invention that contain changes in amino acid residues that 
are not essential for activity. Such polypeptides differ in amino acid sequence from SEQ 
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ID NO:28, 134, 136, 138, 140, yet retain biological activity. In one embodiment, the 
isolated nucleic acid molecule includes a nucleotide sequence encoding a protein that 
includes an amino acid sequence that is at least about 45%, 50%, 55%, 60%, 65%, 75%, 
85%, 95%, or 98% identical to the amino acid sequence of SEQ ID NO:28, 134, 136, 138, 
5 140. 

Accordingly, another aspect of the invention pertains to nucleic acid molecules 
encoding a polypeptide of the invention that contain changes in amino acid residues that 
are not essential for activity. Such polypeptides differ in amino acid sequence from SEQ 
ID NO:39, 142, 144, 146 or 148, yet retain biological activity. In one embodiment, the 
10 isolated nucleic acid molecule includes a nucleotide sequence encoding a protein that 
includes an amino acid sequence that is at least about 45%, 50%, 55%, 60%, 65%, 75%, 
85%, 95%, or 98% identical to the amino acid sequence of SEQ ID NO:39, 142, 144, 146 
or 148. 

Accordingly, another aspect of the invention pertains to nucleic acid molecules 
encoding a polypeptide of the invention that contain changes in amino acid residues that 
are not essential for activity. Such polypeptides differ in amino acid sequence from SEQ 
ID NO:48, 150, 152, 154, or 156, yet retain biological activity. In one embodiment, the 
isolated nucleic acid molecule includes a nucleotide sequence encoding a protein that 
2o includes an amino acid sequence that is at least about 30%, 35%, 40%, 45%, 50%, 55%, 
60%, 65%, 75%, 85%, 95%, or 98% identical to the amino acid sequence of SEQ ID 
NO:48, 150, 152, 154 or 156. 

Accordingly, another aspect of the invention pertains to nucleic acid molecules 
encoding a polypeptide of the invention that contain changes in amino acid residues that 

25 are not essential for activity. Such polypeptides differ in amino acid sequence from SEQ 
ID NO:58, 158, 160, 162 or 164, yet retain biological activity. In one embodiment, the 
isolated nucleic acid molecule includes a nucleotide sequence encoding a protein that 
includes an amino acid sequence that is at least about 30%, 35%, 40%, 45%, 50%, 55%, 
60%, 65%, 75%, 85%, 95%, or 98% identical to the amino acid sequence of SEQ ID 

30 NO:58, 158, 160, 162 or 164. 

An isolated nucleic acid molecule encoding a variant protein can be created by 
introducing one or more nucleotide substitutions, additions or deletions into the nucleotide 
sequence of SEQ ID NO: 1, 2, 8, 9, 15, 16, 21, 22, 26, 27, 37, 38, 46, 47, 56, 57, 77, 80, . 
91, 100, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 
35 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 166, 
167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 
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185, 186, 187, 188, 189, 190, 191 or 192 such that one or more amino acid substitutions, 
additions or deletions are introduced into the encoded protein. Mutations can be 
introduced by standard techniques, such as site-directed mutagenesis and PCR-mediated 
mutagenesis. Preferably, conservative amino acid substitutions are made at one or more 

5 predicted non-essential amino acid residues. A "conservative amino acid substitution" is 
one in which the amino acid residue is replaced with an amino acid residue having a 
similar side chain. Families of amino acid residues having similar side chains have been 
defined in the art. These families include amino acids with basic side chains (e.g., lysine, 
arginine, histidine), acidic side chains (e.g., aspartic acid, glutamic acid), uncharged polar 

1 0 side chains (e.g., glycine, asparagine, glutamine, serine, threonine, tyrosine, cysteine), 
nonpolar side chains (e.g., alanine, valine, leucine, isoleucine, proline, phenylalanine, 
methionine, tryptophan), beta-branched side chains (e.g., threonine, valine, isoleucine) and 
aromatic side chains (e.g., tyrosine, phenylalanine, tryptophan, histidine). Alternatively, 
mutations can be introduced randomly along all or part of the coding sequence, such as by 

15 saturation mutagenesis, and the resultant mutants can be screened for biological activity to 
identify mutants that retain activity. Following mutagenesis, the encoded protein can be 
expressed recombinantly and the activity of the protein can be determined. 

In a preferred embodiment, a mutant polypeptide that is a variant of a polypeptide 
of the invention can be assayed for: (1) the ability to form protein: protein interactions 
20 with proteins in a signaling pathway of the polypeptide of the invention; (2) the ability to 
bind a ligand of the polypeptide of the invention; or (3) the ability to bind to an 
intracellular target protein of the polypeptide of the invention. In yet another preferred 
embodiment, the mutant polypeptide can be assayed for the ability to modulate cellular 
proliferation, cellular migration or chemotaxis, or cellular differentiation. 

25 

The present invention encompasses antisense nucleic acid molecules, i.e., 
molecules which are complementary to a sense nucleic acid encoding a polypeptide of the 
invention, e.g., complementary to the coding strand of a double-stranded cDNA molecule 
or complementary to an mRNA sequence. Accordingly, an antisense nucleic acid can 

^ hydrogen bond to a sense nucleic acid. The antisense nucleic acid can be complementary 
to an entire coding strand, or to only a portion thereof, e.g., all or part of the protein 
coding region (or open reading frame). An antisense nucleic acid molecule can be 
antisense to all or part of a non-coding region of the coding strand of a nucleotide 
sequence encoding a polypeptide of the invention. The non-coding regions ("5* and 3' 

^ untranslated regions") are the 5' and 3' sequences which flank the coding region and are 
not translated into amino acids. 
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An antisense oligonucleotide can be, for example, about 5, 10, 15, 20, 25, 30, 35, 
40, 45 or 50 nucleotides or more in length. An antisense nucleic acid of the invention can 
be constructed using chemical synthesis and enzymatic ligation reactions using procedures 
known in the art. For example, an antisense nucleic acid (e.g., an antisense 

5 oligonucleotide) can be chemically synthesized using naturally occurring nucleotides or 
variously modified nucleotides designed to increase the biological stability of the 
molecules or to increase the physical stability of the duplex formed between the antisense 
and sense nucleic acids, e.g., phosphorothioate derivatives and acridine substituted 
nucleotides can be used. Examples of modified nucleotides which can be used to generate 

10 the antisense nucleic acid include 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 

5-iodouracil, hypoxanthine, xanthine, 4-acetylcytosine, 5-(carboxyhydroxylmethyl) uracil, 
5-carboxymethylaminomethyI-2-thiouridine, 5-carboxymethylaminomethyluracil, 
dihydrouracil, beta-D-galactosylqueosine, inosine, N6-isopentenyladenine, 

1- methylguanine, 1-methylinosine, 2,2-dimethylguanine, 2-methyladenine, 

15 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 7-methylguanine, 
5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, 
beta-D-mannosylqueosine, S'-methoxycarboxymethyluracil, 5-methoxyuracil, 

2- methylthio-N6-isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, 
pseudouracil, queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 

20 5-methyluracil, uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 
5-methyl-2-thiouracil, 3-(3-amino-3-N-2-carboxypropyl) uracil, (acp3)w, and 
2,6-diaminopurine. Alternatively, the antisense nucleic acid can be produced biologically 
using an expression vector into which a nucleic acid has been subcloned in an antisense 
orientation (i.e., RNA transcribed from the inserted nucleic acid will be of an antisense 

25 orientation to a target nucleic acid of interest, described further in the following 
subsection). 

The antisense nucleic acid molecules of the invention are typically administered to 
a subject or generated in situ such that they hybridize with or bind to cellular mRNA , 
and/or genomic DNA encoding a selected polypeptide of the invention to thereby inhibit 

30 expression, e.g., by inhibiting transcription and/or translation. The hybridization can be 
by conventional nucleotide complementarity to form a stable duplex, or, for example, in 
the case of an antisense nucleic acid molecule which binds to DNA duplexes, through 
specific interactions in the major groove of the double helix. An example of a route of 
administration of antisense nucleic acid molecules of the invention includes direct 

35 injection at a tissue site. Alternatively, antisense nucleic acid molecules can be modified 
to target selected cells and then administered systemically. For example, for systemic 
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administration, antisense molecules can be modified such that they specifically bind to 
receptors or antigens expressed on a selected cell surface, e.g., by linking the antisense 
nucleic acid molecules to peptides or antibodies which bind to cell surface receptors or 
antigens. The antisense nucleic acid molecules can also be delivered to cells using the 
vectors described herein. To achieve sufficient intracellular concentrations of the 
antisense molecules, vector constructs in which the antisense nucleic acid molecule is 
placed under the control of a strong pol II or pol III promoter are preferred. 

An antisense nucleic acid molecule of the invention can be an «-anomeric (alpha) 
nucleic acid molecule. An «-anomeric nucleic acid molecule forms specific 
double-stranded hybrids with complementary RNA in which, contrary to the usual p-units, 
the strands run parallel to each other (Gaultier et al. (1987) Nucleic Acids Res. 
15:6625-6641). The antisense nucleic acid molecule can also comprise a 
2'-o-methylribonucleotide (Inoue et al. (1987) Nucleic Acids Res. 15:6131-6148) or a 
chimeric RNA-DNA analogue (Inoue et al. (1987) FEBSLetL 215:327-330). 

The invention also encompasses ribozymes. Ribozymes are catalytic RNA 
molecules with ribonuclease activity which are capable of cleaving a single-stranded 
nucleic acid, such as an mRNA, to which they have a complementary region. Thus, 
ribozymes {e.g., hammerhead ribozymes (described in Haselhoff and Gerlach (1988) 
Nature 334:585-591)) can be used to catalytically cleave mRNA transcripts to thereby 
inhibit translation of the protein encoded by the mRNA. A ribozyme having specificity 
for a nucleic acid molecule encoding a polypeptide of the invention can be designed based 
upon the nucleotide sequence of a cDNA disclosed herein. For example, a derivative of a 
Tetrahymena L-19 IVS RNA can be constructed in which the nucleotide sequence of the 
active site is complementary to the nucleotide sequence to be cleaved in a Cech et al. U.S. 
Patent No. 4,987,071; and Cech et al. U.S. Patent No. 5,1 16,742. Alternatively, an mRNA 
encoding a polypeptide of the invention can be used to select a catalytic RNA having a 
specific ribonuclease activity from a pool of RNA molecules. See, e.g., Bartel and 
Szostak {1993) Science 261:1411-1418. 

The invention also encompasses nucleic acid molecules which form triple helical 
structures. For example, expression of a polypeptide of the invention can be inhibited by 
targeting nucleotide sequences complementary to the regulatory region of the gene 
encoding the polypeptide (e.g., the promoter and/or enhancer) to form triple helical 
structures that prevent transcription of the gene in target cells. See generally Helene 
(1991) Anticancer Drug Des. 6(6):569-84; Helene (1992) Ann. N.Y.Acad. Sci. 660:27-36; 
and Maher (1992) Bioassays 14(12):807-15. 



-79- 



WO 00/78808 



PCT/US00/16883 



In various embodiments, the nucleic acid molecules of the invention can be 
modified at the base moiety, sugar moiety or phosphate backbone to improve, e.g., the 
stability, hybridization, or solubility of the molecule. For example, the deoxyribose 
phosphate backbone of the nucleic acids can be modified to generate peptide nucleic acids 

5 {see Hyrup et al. (1996) Bioorganic & Medicinal Chemistry 4(1): 5-23). As used herein, 
the terms "peptide nucleic acids" or "PNAs" refer to nucleic acid mimics, e.g., DNA 
mimics, in which the deoxyribose phosphate backbone is replaced by a pseudopeptide 
backbone and only the four natural nucleobases are retained. The neutral backbone of 
PNAs has been shown to allow for specific hybridization to DNA and RNA under 

10 conditions of low ionic strength. The synthesis of PNA oligomers can be performed using 
standard solid phase peptide synthesis protocols as described in Hyrup et al. (1996), supra; 
Perry-O'Keefe et al. (1996) Proc. Natl Acad. Sci. USA 93: 14670-675. 

PNAs can be used in therapeutic and diagnostic applications. For example, PNAs 
can be used as antisense or antigene agents for sequence-specific modulation of gene 

^ expression by, e.g., inducing transcription or translation arrest or inhibiting replication. 
PNAs can also be used, e.g., in the analysis of single base pair mutations in a gene by, e.g., 
PNA directed PCR clamping; as artificial restriction enzymes when used in combination 
with other enzymes, e.g., SI nucleases (Hyrup (1996), supra; or as probes or primers for 
DNA sequence and hybridization (Hyrup (1996), supra; Perry-O'Keefe et al. (1996) Proc. 

20 Natl. Acad. Sci. USA 93: 14670-675). 

In another embodiment, PNAs can be modified, e.g., to enhance their stability or 
cellular uptake, by attaching lipophilic or other helper groups to PNA, by the formation of 
PNA-DNA chimeras, or by the use of liposomes or other techniques of drug delivery 
known in the art. For example, PNA-DNA chimeras can be generated which may 
combine the advantageous properties of PNA and DNA. Such chimeras allow DNA 
recognition enzymes, e.g., RNAse H and DNA polymerases, to interact with the DNA 
portion while the PNA portion would provide high binding affinity and specificity. 
PNA-DNA chimeras can be linked using linkers of appropriate lengths selected in terms 
of base stacking, number of bonds between the nucleobases, and orientation (Hyrup 
(1996), supra). The synthesis of PNA-DNA chimeras can be performed as described in 
Hyrup (1996), supra, and Finn et al (1996) Nucleic Acids Res. 24(17):3357-63. For 
example, a DNA chain can be synthesized on a solid support using standard 
phosphoramidite coupling chemistry and modified nucleoside analogs. Compounds such 
as 5 f -(4-methoxytrityl)amino-5'-deoxy-thymidine phosphoramidite can be used as a link 
between the PNA and the 5' end of DNA (Mag et al. (1989) Nucleic Acids Res. 
17:5973-88). PNA monomers are then coupled in a stepwise manner to produce a 
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chimeric molecule with a 5' PNA segment and a 3 f DNA segment (Finn et al. (1996) 
Nucleic Acids Res. 24(17):3357-63). Alternatively, chimeric molecules can be 
synthesized with a 5' DNA segment and a 3' PNA segment (Peterser et al (1975) 
Bioorganic Med. Chem. Lett 5:1119-11124). 

5 In other embodiments, the oligonucleotide may include other appended groups 

such as peptides (e.g., for targeting host cell receptors in vivo ), or agents facilitating 
transport across the cell membrane (see, e.g., Letsinger et al. (1989) Proc. Natl Acad. Sci. 
USA 86:6553-6556; Lemaitre et al. (1987) Proc. Natl Acad. Sci. USA 84:648-652; PCT 
Publication No. WO 88/09810) or the blood-brain barrier (see, e.g., PCT Publication No. 

10 W0 89/10134). In addition, oligonucleotides can be modified with hybridization-triggered 
cleavage agents (see f e.g., Krol et al. (1988) Bio/Techniques 6:958-976) or intercalating 
agents (see, e.g., Zon (1988) Pharm. Res. 5:539-549). To this end, the oligonucleotide 
may be conjugated to another molecule, e.g., a peptide, hybridization triggered 
cross-linking agent, transport agent, hybridization-triggered cleavage agent, etc. 

15 

II. Isolated Proteins and Antibodies 

One aspect of the invention pertains to isolated proteins, and biologically active 
portions thereof, as well as polypeptide fragments suitable for use as immunogens to raise 

20 antibodies directed against a polypeptide of the invention. In one embodiment, the native 
polypeptide can be isolated from cells or tissue sources by an appropriate purification 
scheme using standard protein purification techniques. In another embodiment, 
polypeptides of the invention are produced by recombinant DNA techniques. Alternative 
to recombinant expression, a polypeptide of the invention can be synthesized chemically 

25 using standard peptide synthesis techniques. 

An "isolated" or "purified" protein or biologically active portion thereof is 
substantially free of cellular material or other contaminating proteins from the cell or 
tissue source from which the protein is derived, or substantially free of chemical 
precursors or other chemicals when chemically synthesized. The language "substantially 

30 

free of cellular material" includes preparations of protein in which the protein is separated 
from cellular components of the cells from which it is isolated or recombinantly produced. 
Thus, protein that is substantially free of cellular material includes preparations of protein 
having less than about 30%, 20%, 10%, or 5% (by dry weight) of heterologous protein 
(also referred to herein as a "contaminating protein"). When the protein or biologically 
active portion thereof is recombinantly produced, it is also preferably substantially free of 
culture medium, i.e., culture medium represents less than about 20%, 10%, or 5% of the 
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volume of the protein preparation. When the protein is produced by chemical synthesis, it 
is preferably substantially free of chemical precursors or other chemicals, i.e., it is 
separated from chemical precursors or other chemicals which are involved in the synthesis 
of the protein. Accordingly such preparations of the protein have less than about 30%, 
5 20%, 10%, 5% (by dry weight) of chemical precursors or compounds other than the 
polypeptide of interest. 

Biologically active portions of a polypeptide of the invention include polypeptides 
comprising amino acid sequences sufficiently identical to or derived from the amino acid 
sequence of the protein (e.g., the amino acid sequence shown in any of SEQ ID NO:4, 6, 

10 7, 13, 14, 18, 23, 28, 33, 34, 35, 36, 39, 42, 44, 45, 48, 51, 52, 53, 54, 55, 58, 61, 62, 63, 
64, 65, 71, 76, 34, 78, 79, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 92, 93, 94, 95, 96, 97, 98, 
or 99 which include fewer amino acids than the full length protein, and exhibit at least one 
activity of the corresponding full-length protein. Typically, biologically active portions 
comprise a domain or motif with at least one activity of the corresponding protein. A 

15 biologically active portion of a protein of the invention can be a polypeptide which is, for 
example, 10, 25, 50, 100 or more amino acids in length. Moreover, other biologically 
active portions, in which other regions of the protein are deleted, can be prepared by 
recombinant techniques and evaluated for one or more of the functional activities of the 
native form of a polypeptide of the invention. 

20 

Preferred polypeptides have the amino acid sequence of SEQ ID NO:4, 6, 7, 13, 
14, 18, 23, 28, 33, 34, 35, 36, 39, 42, 44, 45, 48, 51, 52, 53, 54, 55, 58, 61, 62, 63, 64, 65, 
71, 76, 34, 78, 79, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 92, 93, 94, 95, 96, 97, 98, or 99. 
Other useful proteins are substantially identical (e.g., at least about 45%, preferably 55%, 
25 65%, 75%, 85%, 95%, or 99%) to any of SEQ ID NO:4, 6, 7, 13, 14, 18, 23, 28, 33, 34, 
35, 36, 39, 42, 44, 45, 48, 51, 52, 53, 54, 55, 58, 61, 62, 63, 64, 65, 71, 76, 34, 78, 79, 81, 
82, 83, 84, 85, 86, 87, 88, 89, 90, 92, 93, 94, 95, 96, 97, 98, or 99 and retain the functional 
activity of the protein of the corresponding naturally-occurring protein yet differ in amino 
acid sequence due to natural allelic variation or mutagenesis. 

30 To determine the percent identity of two amino acid sequences or of two nucleic 

acids, the sequences are aligned for optimal comparison purposes (e.g., gaps can be 
introduced in the sequence of a first amino acid or nucleic acid sequence for optimal 
alignment with a second amino or nucleic acid sequence). The amino acid residues or 
nucleotides at corresponding amino acid positions or nucleotide positions are then 

35 compared. When a position in the first sequence is occupied by the same amino acid 
residue or nucleotide as the corresponding position in the second sequence, then the 
molecules are identical at that position. The percent identity between the two sequences is 
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a function of the number of identical positions shared by the sequences (i.e., % identity = 
# of identical positions/total # of positions (e.g., overlapping positions) x 100). In one 
embodiment, the two sequences are the same length. 

The determination of percent identity between two sequences can be accomplished 

5 using a mathematical algorithm. A preferred, non-limiting example of a mathematical 
algorithm utilized for the comparison of two sequences is the algorithm of Karlin and 
Altschul (1990) Proc. Natl. Acad Sci. USA 87:2264-2268, modified as in Karlin and 
Altschul (1593) Proc. Natl Acad. ScL USA 90:5873-5877. Such an algorithm is 
incorporated into the NBLAST and XBLAST programs of Altschul, et al. (1990) J. Mol 

10 Biol 21 5:403-410. BLAST nucleotide searches can be performed with the NBLAST 
program, score = 100, wordlength = 12 to obtain nucleotide sequences homologous to a 
nucleic acid molecules of the invention. BLAST protein searches can be performed with 
the XBLAST program, score = 50, wordlength = 3 to obtain amino acid sequences 
homologous to a protein molecules of the invention. To obtain gapped alignments for 

15 comparison purposes, Gapped BLAST can be utilized as described in Altschul et al. 

(1997) Nucleic Acids Res. 25:3389-3402. Alternatively, PSI-Blast can be used to perform 
an iterated search which detects distant relationships between molecules (Id.). When 
utilizing BLAST, Gapped BLAST, and PSI-Blast programs, the default parameters of the 
respective programs (e.g., XBLAST and NBLAST) can be used. See 

20 http://www.ncbi.nlm.nih.gov. 

Another preferred, non-limiting example of a mathematical algorithm utilized for 
the comparison of sequences is the algorithm of Myers and Miller, CABIOS (1989). Such 
an algorithm is incorporated into the ALIGN program (version 2.0) which is part of the 
CGC sequence alignment software package. When utilizing the ALIGN program for 

25 

comparing amino acid sequences, a PAM120 weight residue table, a gap length penalty of 
12, and a gap penalty of 4 can be used. Additional algorithms for sequence analysis are 
known in the art and include ADVANCE and ADAM as described in Torellis and Robotti 
(1994) Comput. Appl Biosci. t 70:3-5; and FASTA described in Pearson and Lipman 
(1988) Proc. Natl Acad. Sci. 55:2444-8. Within FASTA, ktup is a control option that sets 
the sensitivity and speed of the search. If ktup=2, similar regions in the two sequences 
being compared are found by looking at pairs of aligned residues; if ktup=l, single aligned 
amino acids are examined, ktup can be set to 2 or 1 for protein sequences, or from 1 to 6 
for DNA sequences. The default if ktup is not specified is 2 for proteins and 6 for DNA. 
For a further description of FASTA parameters, see 

http://bioweb.pasteur.fr/docs/man/man/fasta. I.html#sect2, the contents of which are 
incorporated herein by reference. 
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The percent identity between two sequences can be determined using techniques 
similar to those described above, with or without allowing gaps. In calculating percent 
identity, only exact matches are counted. 

The invention also provides chimeric or fusion proteins. As used herein, a 
5 "chimeric protein" or "fusion protein" comprises all or part (preferably biologically active) 
of a polypeptide of the invention operably linked to a heterologous polypeptide (i.e., a 
polypeptide other than the same polypeptide of the invention). Within the fusion protein, 
the term "operably linked" is intended to indicate that the polypeptide of the invention and 
the heterologous polypeptide are fused in-frame to each other. The heterologous 
1 0 polypeptide can be fused to the N-terminus or C-terminus of the polypeptide of the 
invention. 

One useful fusion protein is a GST fusion protein in which the polypeptide of the 
invention is fused to the C-terminus of GST sequences. Such fusion proteins can facilitate 
the purification of a recombinant polypeptide of the invention. 

In another embodiment, the fusion protein contains a heterologous signal sequence 
at its N-terminus. For example, the native signal sequence of a polypeptide of the 
invention can be removed and replaced with a signal sequence from another protein. For 
example, the gp67 secretory sequence of the baculovirus envelope protein can be used as a 

20 heterologous signal sequence {Current Protocols in Molecular Biology, Ausubel et al., 
eds., John Wiley & Sons, 1992). Other examples of eukaryotic heterologous signal 
sequences include the secretory sequences of melittin and human placental alkaline 
phosphatase (Stratagene; La Jolla, California). In yet another example, useful prokaryotic 
heterologous signal sequences include the phoA secretory signal (Sambrook et al., supra) 

25 and the protein A secretory signal (Pharmacia Biotech; Piscataway, New Jersey). 

In yet another embodiment, the fusion protein is an immunoglobulin fusion protein 
in which all or part of a polypeptide of the invention is fused to sequences derived from a 
member of the immunoglobulin protein family. The immunoglobulin fusion proteins of 
the invention can be incorporated into pharmaceutical compositions and administered to a 

30 subject to inhibit an interaction between a ligand (soluble or membrane-bound) and a 
protein on the surface of a cell (receptor), to thereby suppress signal transduction in vivo. 
The immunoglobulin fusion protein can be used to affect the bioavailability of a cognate 
ligand of a polypeptide of the invention. Inhibition of ligand/receptor interaction may be 
useful therapeutically, both for treating proliferative and differentiative disorders and for 

35 modulating (e.g., promoting or inhibiting) cell survival. Moreover, the immunoglobulin 
fusion proteins of the invention can be used as immunogens to produce antibodies directed 
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15 



against a polypeptide of the invention in a subject, to purify ligands and in screening 
assays to identify molecules which inhibit the interaction of receptors with ligands. 

Chimeric and fusion proteins of the invention can be produced by standard 
recombinant DNA techniques. In another embodiment, the fusion gene can be synthesized 

5 by conventional techniques including automated DNA synthesizers. Alternatively, PCR 
amplification of gene fragments can be carried out using anchor primers which give rise to 
complementary overhangs between two consecutive gene fragments which can 
subsequently be annealed and reamplified to generate a chimeric gene sequence (see, e.g., 
Ausubel et al. ( supra). Moreover, many expression vectors are commercially available 

10 that already encode a fusion moiety (e.g., a GST polypeptide). A nucleic acid encoding a 
polypeptide of the invention can be cloned into such an expression vector such that the 
fusion moiety is linked in-frame to the polypeptide of the invention. 

A signal sequence of a polypeptide of the invention (SEQ ID NO:5, 12, 19, 25, 30, 
41, 49 or 59) can be used to facilitate secretion and isolation of the secreted protein or 
other proteins of interest. Signal sequences are typically characterized by a core of 
hydrophobic amino acids which are generally cleaved from the mature protein during 
secretion in one or more cleavage events. Such signal peptides contain processing sites 
that allow cleavage of the signal sequence from the mature proteins as they pass through 
the secretory pathway. Thus, the invention pertains to the described polypeptides having a 
signal sequence, as well as to the signal sequence itself and to the polypeptide in the 
absence of the signal sequence ( i.e., the cleavage products). In one embodiment, a nucleic 
acid sequence encoding a signal sequence of the invention can be operably linked in an 
expression vector to a protein of interest, such as a protein which is ordinarily not secreted 
or is otherwise difficult to isolate. The signal sequence directs secretion of the protein, 
such as from a eukaryotic host into which the expression vector is transformed, and the 
signal sequence is subsequently or concurrently cleaved. The protein can then be readily 
purified from the extracellular medium by art recognized methods. Alternatively, the 
signal sequence can be linked to the protein of interest using a sequence which facilitates 
purification, such as with a GST domain. 

In another embodiment, the signal sequences of the present invention can be used 
to identify regulatory sequences, e.g., promoters, enhancers, repressors. Since signal 
sequences are the most amino-terminal sequences of a peptide, it is expected that the 
nucleic acids which flank the signal sequence on its amino-terminal side will be regulatory 
35 sequences which affect transcription. Thus, a nucleotide sequence which encodes all or a 
portion of a signal sequence can be used as a probe to identify and isolate signal sequences 
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and their flanking regions, and these flanking regions can be studied to identify regulatory 
elements therein. 

The present invention also pertains to variants of the polypeptides of the invention. 
Such variants have an altered amino acid sequence which can function as either agonists 

5 (mimetics) or as antagonists. Variants can be generated by mutagenesis, e.g., discrete 
point mutation or truncation. An agonist can retain substantially the same, or a subset, of 
the biological activities of the naturally occurring form of the protein. An antagonist of a 
protein can inhibit one or more of the activities of the naturally occurring form of the 
protein by, for example, competitively binding to a downstream or upstream member of a 

1 0 cellular signaling cascade which includes the protein of interest. Thus, specific biological 
effects can be elicited by treatment with a variant of limited function. Treatment of a 
subject with a variant having a subset of the biological activities of the naturally occurring 
form of the protein can have fewer side effects in a subject relative to treatment with the 
naturally occurring form of the protein. 

15 

Variants of a protein of the invention which function as either agonists (mimetics) 
or as antagonists can be identified by screening combinatorial libraries of mutants, e.g., 
truncation mutants, of the protein of the invention for agonist or antagonist activity. In 
one embodiment, a variegated library of variants is generated by combinatorial 
^ mutagenesis at the nucleic acid level and is encoded by a variegated gene library. A 
variegated library of variants can be produced by, for example, enzymatically ligating a 
mixture of synthetic oligonucleotides into gene sequences such that a degenerate set of 
potential protein sequences is expressible as individual polypeptides, or alternatively, as a 
set of larger fusion proteins {e.g., for phage display). There are a variety of methods 
which can be used to produce libraries of potential variants of the polypeptides of the 
invention from a degenerate oligonucleotide sequence. Methods for synthesizing 
degenerate oligonucleotides are known in the art (see, e.g., Narang (1983) Tetrahedron 
39:3; Itakura et al. (1984) Annu. Rev. Biochem. 53:323; Itakura et al. (1984) Science 
198:1056; Ike et al. (1983) Nucleic Acid Res. 1 1:477). 

30 In addition, libraries of fragments of the coding sequence of a polypeptide of the 

invention can be used to generate a variegated population of polypeptides for screening 
and subsequent selection of variants. For example, a library of coding sequence fragments 
can be generated by treating a double stranded PCR fragment of the coding sequence of 
interest with a nuclease under conditions wherein nicking occurs only about once per 

35 molecule, denaturing the double stranded DNA, renaturing the DNA to form double 
stranded DNA which can include sense/antisense pairs from different nicked products, 
removing single stranded portions from reformed duplexes by treatment with SI nuclease, 
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and ligating the resulting fragment library into an expression vector. By this method, an 
expression library can be derived which encodes N-terminal and internal fragments of 
various sizes of the protein of interest. 

Several techniques are known in the art for screening gene products of 
^ combinatorial libraries made by point mutations or truncation, and for screening cDNA 
libraries for gene products having a selected property. The most widely used techniques, 
which are amenable to high through-put analysis, for screening large gene libraries 
typically include cloning the gene library into replicable expression vectors, transforming 
appropriate cells with the resulting library of vectors, and expressing the combinatorial 
10 genes under conditions in which detection of a desired activity facilitates isolation of the 
vector encoding the gene whose product was detected. Recursive ensemble mutagenesis 
(REM), a technique which enhances the frequency of functional mutants in the libraries, 
can be used in combination with the screening assays to identify variants of a protein of 
the invention (Arkin and Yourvan (1992) Proc. Natl. Acad. Sci. USA 59:781 1-7815; 
15 Delgrave et al. (1993) Protein Engineering 6(3):327~331). 

The polypeptides of the invention can exhibit post-translational modifications, 
including, but not limited to glycosylations, {e.g., N-linked or O-linked glycosylations), 
myristylations, palmitylations, acetylations and phosphorylations {e.g., serine/threonine or 
tyrosine). In one embodiment, the TANGO 253, TANGO 257, INTERCEPT 258 or 
TANGO 281 polypeptides of the invention exhibit reduced levels of O-linked 
glycosylation and/or N-linked glycosylation relative to endogenously expressed TANGO 
253, TANGO 257, INTERCEPT 258 or TANGO 281 polypeptides of the invention do not 
exhibit O-linked glycosylation or N-linked glycosylation. The post-translational 
modifications of TANGO 253, TANGO 257, INTERCEPT 258 or TANGO 281 
polypeptides will vary depending upon the host cell in which in TANGO 253, TANGO 
257, INTERCEPT 258 or TANGO 281 is expressed. Further, post-translational 
modifications of TANGO 253, TANGO 257, INTERCEPT 258 or TANGO 281 
polypeptides such as glycosylation can be prevented by treating cells, e.g., with 
tunicamycin. 

An isolated polypeptide of the invention, or a fragment thereof, can be used as an 
immunogen to generate antibodies using standard techniques for polyclonal and 
monoclonal antibody preparation. The full-length polypeptide or protein can be used or, 
alternatively, the invention provides antigenic peptide fragments for use as immunogens. 
35 In one embodiment, an isolated polypeptide or fragment thereof which lacks N- and/or O- 
linked glycosylation is used as an immunogen to generate antibodies using standard 
techniques known to those of skill in the art. The antigenic peptide of a protein of the 
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invention comprises at least 8 (preferably 10, 15, 20, or 30) amino acid residues of the 
amino acid sequence of SEQ ID NO:3, 10, 17, 23, 28, 39, 48, 58, 102, 104, 106, 108, 1 10, 
112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 
148, 150, 152, 154, 156, 158, 160, 162 or 164 and encompasses an epitope of the protein 
5 such that an antibody raised against the peptide forms a specific immune complex with the 
protein. 

Preferred epitopes encompassed by the antigenic peptide are regions that are 
located on the surface of the protein, e.g., hydrophilic regions. Figures 2, 4, 10, 12, 19, 21, 
29 and 32, are hydropathy plots of the proteins of the invention. These plots or similar 
!0 analyses can be used to identify hydrophilic regions. 

An immunogen typically is used to prepare antibodies by immunizing a suitable 
subject, (e.g., rabbit, goat, mouse or other mammal). An appropriate immunogenic 
preparation can contain, for example, recombinantly expressed or chemically synthesized 
polypeptide. The preparation can further include an adjuvant, such as Freund's complete 
or incomplete adjuvant, or similar immunostimulatory agent. 

Accordingly, another aspect of the invention pertains to antibodies directed 
against a polypeptide of the invention. The term "antibody" as used herein refers to 
immunoglobulin molecules and immunologically active portions of immunoglobulin 

20 molecules, i.e., molecules that contain an antigen binding site which specifically binds an 
antigen, such as a polypeptide of the invention e.g., an epitope of a polypeptide of the 
invention. A molecule which specifically binds to a given polypeptide of the invention is 
a molecule which binds the polypeptide, but does not substantially bind other molecules in 
a sample, e.g., a biological sample, which naturally contains the polypeptide. Examples of 

25 immunologically active portions of immunoglobulin molecules include F(ab) and F(ab')2 
fragments which can be generated by treating the antibody with an enzyme such as pepsin. 
The invention provides polyclonal and monoclonal antibodies. The term "monoclonal 
antibody" or "monoclonal antibody composition", as used herein, refers to a population of 
antibody molecules that contain only one species of an antigen binding site capable of 

30 immunoreacting with a particular epitope. 

Polyclonal antibodies can be prepared as described above by immunizing a 
suitable subject with a polypeptide of the invention as an immunogen. Preferred 
polyclonal antibody compositions are ones that have been selected for antibodies directed 
against a polypeptide or polypeptides of the invention. Particularly preferred polyclonal 
35 antibody preparations are ones that contain only antibodies directed against a polypeptide 
or polypeptides of the invention. Particularly preferred immunogen compositions are 
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those that contain no other human proteins such as, for example, immunogen 
compositions made using a non-human host cell for recombinant expression of a 
polypeptide of the invention. In such a manner, the only human epitope or epitopes 
recognized by the resulting antibody compositions raised against this immunogen will be 
5 present as part of a polypeptide or polypeptides of the invention. 

The antibody titer in an immunized subject can be monitored over time by standard 
techniques, such as with an enzyme linked immunosorbent assay (ELISA) using 
immobilized polypeptide. If desired, the antibody molecules can be isolated from the 
mammal (e.g., from the blood) and further purified by well-known techniques, such as 

10 protein A chromatography to obtain the IgG fraction. Alternatively, antibodies specific 
for a protein or polypeptide of the invention can be selected for (e.g., partially purified) or 
purified by, e.g., affinity chromatography. For example, a recombinantly expressed and 
purified (or partially purified) protein of the invention is produced as described herein, and 
covalently or non-covalently coupled to a solid support such as, for example, a 

15 chromatography column. The column can then be used to affinity purify antibodies 
specific for the proteins of the invention from a sample containing antibodies directed 
against a large number of different epitopes, thereby generating a substantially purified 
antibody composition, i.e, one that is substantially free of contaminating antibodies. By a 
substantially purified antibody composition is meant, in this context, that the antibody 

20 sample contains at most only 30% (by dry weight) of contaminating antibodies directed 
against epitopes other than those on the desired protein or polypeptide of the invention, 
and preferably at most 20%, yet more preferably at most 10%, and most preferably at most 
5% (by dry weight) of the sample is contaminating antibodies. A purified antibody 
composition means that at least 99% of the antibodies in the composition are directed 

25 against the desired protein or polypeptide of the invention. 

At an appropriate time after immunization, e.g., when the specific antibody titers 
are highest, antibody-producing cells can be obtained from the subject and used to prepare 
monoclonal antibodies by standard techniques, such as the hybridoma technique originally 
described by Kohler and Milstein (1975) Nature 256:495-497, the human B cell 

30 

hybridoma technique (Kozbor et al. (1983) Immunol Today 4:72), the EBV-hybndoma 
technique (Cole et al. (1985), Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, 
Inc., pp. 77-96) or trioma techniques. The technology for producing hybridomas is well 
known (see generally Current Protocols in Immunology (1994) Coligan et al. (eds.) John 
Wiley & Sons, Inc., New York, NY). Hybridoma cells producing a monoclonal antibody 

35 

of the invention are detected by screening the hybridoma culture supernatants for 
antibodies that bind the polypeptide of interest, e.g., using a standard ELISA assay. 
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Alternative to preparing monoclonal antibody-secreting hybridomas, a monoclonal 
antibody directed against a polypeptide of the invention can be identified and isolated by 
screening a recombinant combinatorial immunoglobulin library (e.g., an antibody phage 
display library) with the polypeptide of interest. Kits for generating and screening phage 

5 display libraries are commercially available (e.g. , the Pharmacia Recombinant Phage 
Antibody System, Catalog No. 27-9400-01 ; and the Stratagene Sur/ZAP™ Phage Display 
Kit, Catalog No. 240612). Additionally, examples of methods and reagents particularly 
amenable for use in generating and screening antibody display library can be found in, for 
example, U.S. Patent No. 5,223,409; PCT Publication No. WO 92/18619; PCT 

10 Publication No. WO 91/17271; PCT Publication No. WO 92/20791; PCT Publication No. 
WO 92/15679; PCT Publication No. WO 93/01288; PCT Publication No. WO 92/01047; 
PCT Publication No. WO 92/09690; PCT Publication No. WO 90/02809; Fuchs et al. 
(1991) Bio/Technology 9:1370-1372; Hay et al. (1992) Hum. Antibod. Hybridomas 
3:81-85; Huse et al. (1989) Science 246:1275-1281; Griffiths et al. (1993) EMBOJ. 

15 12:725-734. 

Additionally, recombinant antibodies, such as chimeric and humanized 
monoclonal antibodies, comprising both human and non-human portions, which can be 
made using standard recombinant DNA techniques, are within the scope of the invention. 
A chimeric antibody is a molecule in which different portions are derived from different 

20 animal species, such as those having a variable region derived from a murine mAb and a 
human immunoglobulin constant region. (See, e.g., Cabilly et al., U.S. Patent No. 
4,816,567; and Boss et al., U.S. Patent No. 4,816397, which are incorporated herein by 
reference in their entirety.) Humanized antibodies are antibody molecules from non- 
human species having one or more complementarily determining regions (CDRs) from 

25 the non-human species and a framework region from a human immunoglobulin molecule. 
(See, e.g., Queen, U.S. Patent No. 5,585,089, which is incorporated herein by reference in 
its entirety.) Such chimeric and humanized monoclonal antibodies can be produced by 
recombinant DNA techniques known in the art, for example using methods described in 
PCT Publication No. WO 87/02671; European Patent Application 184,187; European 

30 Patent Application 1 7 1 ,496; European Patent Application 1 73,494; PCT Publication No. 
WO 86/01533; U.S. Patent No. 4,816,567; European Patent Application 125,023; Better et 
al. (1988) Science 240:1041-1043; Liu et al. (1987) Proc. Natl. Acad. Sci. USA 
84:3439-3443; Liu et al. (1987) J. Immunol. 139:3521-3526; Sun et al. (1987) Proc. Natl. 
Acad. Sci. USA 84:214-218; Nishimura et al. (1987) Cane. Res. 47:999-1005; Wood et al. 

35 (1 985) Nature 3 14:446-449; and Shaw et al. (1 988) J. Natl. Cancer Inst. 80: 1 553-1 559); 
Morrison (1985) Science 229:1202-1207; Oi et al. (1986) Bio/Techniques 4:214; U.S. 
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Patent 5,225,539; Jones et al. (1986) Nature 321:552-525; Verhoeyan et al. (1988) Science 
239:1534; and Beidler et al. (1988) J. Immunol. 141:4053-4060. 

Completely human antibodies are particularly desirable for therapeutic treatment 
of human patients. Such antibodies can be produced, for example, using transgenic mice 

5 which are incapable of expressing endogenous immunoglobulin heavy and light chains 
genes, but which can express human heavy and light chain genes. The transgenic mice are 
immunized in the normal fashion with a selected antigen, e.g., all or a portion of a 
polypeptide of the invention. Monoclonal antibodies directed against the antigen can be 
obtained using conventional hybridoma technology. The human immunoglobulin 

1 0 transgenes harbored by the transgenic mice rearrange during B cell differentiation, and 
subsequently undergo class switching and somatic mutation. Thus, using such a 
technique, it is possible to produce therapeutically useful IgG, IgA and IgE antibodies. 
For an overview of this technology for producing human antibodies, see Lonberg and 
Huszar (1995, Int. Rev. Immunol. 13:65-93). For a detailed discussion of this technology 

1 5 for producing human antibodies and human monoclonal antibodies and protocols for 
producing such antibodies, see, e.g., U.S. Patent 5,625,126; U.S. Patent 5,633,425; U.S. 
Patent 5,569,825; U.S. Patent 5,661,016; and U.S. Patent 5,545,806. In addition, 
companies such as Abgenix, Inc. (Freemont, CA), can be engaged to provide human 
antibodies directed against a selected antigen using technology similar to that described 

20 above. 

Completely human antibodies which recognize a selected epitope can be generated 
using a technique referred to as "guided selection." In this approach a selected non-human 
monoclonal antibody, e.g. , a mouse antibody, is used to guide the selection of a 
completely human antibody recognizing the same epitope. (Jespers et al. (1994) 
25 Bio/technology 12:899-903). 

An antibody directed against a polypeptide of the invention (e.g., monoclonal 
antibody) can be used to isolate the polypeptide by standard techniques, such as affinity 
chromatography or immunoprecipitation. Moreover, such an antibody can be used to 
3 o detect the protein (e.g. , in a cellular lysate or cell supernatant) in order to evaluate the 
abundance and pattern of expression of the polypeptide. The antibodies can also be used 
diagnostically to monitor protein levels in tissue as part of a clinical testing procedure, 
e.g., to, for example, determine the efficacy of a given treatment regimen. Detection can 
be facilitated by coupling the antibody to a detectable substance. Examples of detectable 
35 substances include various enzymes, prosthetic groups, fluorescent materials, luminescent 
materials, bioluminescent materials, and radioactive materials. Examples of suitable 
enzymes include horseradish peroxidase, alkaline phosphatase, beta-galactosidase, or 

-91 - 



WO 00/78808 



PCT/US00/16883 



acetylcholinesterase; examples of suitable prosthetic group complexes include 
streptavidin/biotin and avidin/biotin; examples of suitable fluorescent materials include 
umbelliferone, fluorescein, fluorescein isothiocyanate, rhodamine, dichlorotriazinylamine 
fluorescein, dansyl chloride or phycoerythrin; an example of a luminescent material 
includes luminol; examples of bioluminescent materials include luciferase, luciferin, and 
aequorin, and examples of suitable radioactive material include 125 I, 131 1, 35§ or 3h, 

Further, an antibody (or fragment thereof) can be conjugated to a therapeutic 
moiety such as a cytotoxin, a therapeutic agent or a radioactive metal ion. A cytotoxin or 
cytotoxic agent includes any agent that is detrimental to cells. Examples include taxol, 
cytochalasin B, gramicidin D, ethidium bromide, emetine, mitomycin, etoposide, 
tenoposide, vincristine, vinblastine, colchicin, doxorubicin, daunorubicin, dihydroxy 
anthracin dione, mitoxantrone, mithramycin, actinomycin D, 1-dehydrotestosterone, 
glucocorticoids, procaine, tetracaine, lidocaine, propranolol, and puromycin and analogs 
or homologs thereof Therapeutic agents include, but are not limited to, antimetabolites 
(e.g., methotrexate, 6-mercaptopurine, 6-thioguanine, cytarabine, 5-fluorouracil 
decarbazine), alkylating agents (e.g., mechlorethamine, thioepa chlorambucil, melphalan, 
carmustine (BSNU) and lomustine (CCNU), cyclothosphamide, busulfan, 
dibromomannitol, streptozotocin, mitomycin C, and cis-dichlorodiamine platinum (II) 
(DDP) cisplatin), anthracyclines (e.g., daunorubicin (formerly daunomycin) and 
doxorubicin), antibiotics (e.g., dactinomycin (formerly actinomycin), bleomycin, 
mithramycin, and anthramycin (AMC)), and anti-mitotic agents (e.g., vincristine and 
vinblastine). 

The conjugates of the invention can be used for modifying a given biological 
response, the drug moiety is not to be construed as limited to classical chemical 
therapeutic agents. For example, the drug moiety may be a protein or polypeptide 
possessing a desired biological activity. Such proteins may include, for example, a toxin 
such as abrin, ricin A, pseudomonas exotoxin, or diphtheria toxin; a protein such as tumor 
necrosis factor, d-interferon, P-interferon, nerve growth factor, platelet derived growth 
factor, tissue plasminogen activator; or, biological response modifiers such as, for 
example, lymphokines, interleukin-1 ("IL-1"), interleukin-2 ("IL-2"), interleukin-6 
("IL-6"), granulocyte macrophase colony stimulating factor ("GM-CSF"), granulocyte 
colony stimulating factor ("G-CSF"), or other growth factors. 

Techniques for conjugating a therapeutic moiety to antibodies are well known, see, 
e.g., Arnon et al., "Monoclonal Antibodies For Immunotargeting Of Drugs In Cancer 
Therapy", in Monoclonal Antibodies And Cancer Therapy, Reisfeld et al. (eds.), pp. 
243-56 (Alan R. Liss, Inc. 1985); Hellstrom et al., "Antibodies For Drug Delivery", in 
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Controlled Drug Delivery (2nd Ed.), Robinson et al. (eds.), pp. 623-53 (Marcel Dekker, 
Inc. 1987); Thorpe, "Antibody Carriers Of Cytotoxic Agents In Cancer Therapy : A 
Review", in Monoclonal Antibodies f 84: Biological And Clinical Applications, Pinchera et 
al. (eds.), pp. 475-506 (1985); "Analysis, Results, And Future Prospective Of The 
5 Therapeutic Use Of Radiolabeled Antibody In Cancer Therapy", in Monoclonal 
Antibodies For Cancer Detection And Therapy, Baldwin et al. (eds.), pp. 303-16 
(Academic Press 1985), and Thorpe et al., "The Preparation And Cytotoxic Properties Of 
Antibody-Toxin Conjugates", Immunol. Rev., 62:119-58 (1982). 

Alternatively, an antibody can be conjugated to a second antibody to form an 
10 antibody heteroconjugate as described by Segal in U.S. Patent No. 4,676,980. 

Accordingly, in one aspect, the invention provides substantially purified antibodies 
or fragment thereof, including human, non-human, chimeric, and humanized antibodies, 
which antibodies or fragments specifically bind to a polypeptide comprising an amino acid 
sequence of any one of SEQ ID NOs:3, 10, 17, 23, 28, 39, 48, 58, 102, 104, 106, 108, 1 10, 
15 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 
148, 150, 152, 154, 156, 158, 160, 162 or 164, or an amino acid sequence encoded by the 
cDNA insert of a clone deposited with the ATCC® as Accession Number 207222, 
Accession Number 207215, Accession Number 207217, Accession Number 207221, or 
patent deposit Number PTA-224, or a complement thereof. In another aspect, the 
invention provides substantially purified antibodies or fragments thereof, including 
human, non-human, chimeric and humanized antibodies, which antibodies or fragments 
thereof specifically bind to a polypeptide comprising a fragment of at least 8 contiguous 
amino acid residues, preferably at least 1 5 contiguous amino acid residues, of the amino 
acid sequence of any one of SEQ ED NOs:3, 10, 17, 23, 28, 39, 48, 58, 102, 104, 106, 108, 
110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 
146, 148, 150, 152, 154, 156, 158, 160, 162, or 164. 

In another aspect, the invention provides substantially purified antibodies or 
fragments thereof, including human, non-human, chimeric and humanized antibodies, 

30 which antibodies or fragments thereof, which antibodies or fragments thereof specifically 
bind to a polypeptide comprising an amino acid sequence which is at least 95% identical 
to the amino acid sequence of any one of SEQ ID NOs:3, 10, 17, 23, 28, 39, 48, 58, 102, 
104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 
140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162 or 1 64, wherein the percent 

35 identity is determined using the ALIGN program of the GCG software package with a 
PAM120 weight residue table, a gap length penalty of 12, and a gap penalty of 4. In 
another aspect, the invention provides substantially purified antibodies or fragments 
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thereof, including human, non-human, chimeric and humanized antibodies, which 
antibodies or fragments thereof specifically bind to a polypeptide comprising and an 
amino acid sequence which is encoded by a nucleic acid molecule which hybridizes to the 
nucleic acid molecule consisting of any one of SEQ ID Nos:l, 2, 8, 9, 15, 16, 21, 22, 26, 

5 27,37,38,46,47, 56,57,77,80,91, 100, 101, 103, 105, 107, 109, 111, 113, 115, 117, 
119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 
155, 157, 159, 161, 163, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 
178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191or 192, or the cDNA 
insert of a clone deposited as ATCC® as Accession Number 207222, Accession Number 

10 20721 5, Accession Number 207217, Accession number 207221 or patent deposit Number 
PTA-224, or a complement thereof, under conditions of hybridization of 6X SSC at 45°C 
and washing in 0.2 X SSC, 0.1% SDS at 50°C, 55°C, 60°C or 65°C. 

In various embodiments, the substantially purified antibodies or fragments thereof 
of the invention are polyclonal, monoclonal, Fab fragments, single chain antibodies, or 
15 F(ab') 2 fragments. The non-human antibodies or fragments thereof of the invention can be 
goat, mouse, sheep, horse, chicken, rabbit or rat antibodies or antibodies fragments. In a 
preferred embodiment, the antibodies of the invention are monoclonal antibodies that 
specifically bind to a polypeptide of the invention. 

The substantially purified antibodies or fragments thereof specifically bind to a 
signal peptide, a secreted sequence, an extracellular domain, a transmembrane or a 
cytoplasmic domain cytoplasmic membrane of a polypeptide of the invention. In a 
particularly preferred embodiment, the substantially purified antibodies or fragments 
thereof of the invention specifically bind to a secreted sequence or an extracellular domain 

25 of the amino acid sequence of SEQ ID NO:3, 10, 17, 23, 28, 39, 48, 58, 102, 104, 106, 
108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 
144, 146, 148, 150, 152, 154, 156, 158, 160, 162 or 164, or the amino acid sequence 
encoded by the EpT253, EpTm253, EpT257, EpTm257, EpT258, EpTm258, EpT281 or 
EpTm281 cDNA insert of ATCC® Accession Number 207222, Accession Number 

^ 207215, Accession Number 207217, Accession Number 207221, or patent deposit 

Number PTA-224, or a complement thereof. In one embodiment, the extracellular domain 
to which the antibody or antibody fragment binds comprises at least 8 contiguous amino 
acid residues, preferably at least 10 or at least 15 contiguous amino acid residues, of 
amino acid residues 30 to 206 of SEQ ID NO:28 (SEQ ID NO:76), amino acid residues 

35 272 to 370 of SEQ ID NO:28 (SEQ ID NO:34); amino acid residues 30 to 249 of SEQ ID 
NO:39 (SEQ ED NO: 83), amino acid residues 39 to 123 of SEQ ID NO:48 (SEQ ID 
NO:50), or amino acid residues 27 to 1 12 of SEQ ID NO:58 (SEQ ID NO:61). 
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Any of the antibodies of the invention can be conjugated to a therapeutic moiety or 
to a detectable substance. Non-limiting examples of detectable substances that can be 
conjugated to the antibodies of the invention are an enzyme, a prosthetic group, a 
fluorescent material, a luminescent material, a bioluminescent material, and a radioactive 
material. 

The invention also provides a kit containing an antibody of the invention 
conjugated to a detectable substance, and instructions for use. Still another aspect of the 
invention is a pharmaceutical composition comprising an antibody of the invention and a 
pharmaceutical^ acceptable carrier. In preferred embodiments, the pharmaceutical 
composition contains an antibody of the invention, a therapeutic moiety, and a 
pharmaceutical^ acceptable carrier. 

Still another aspect of the invention is a method of making an antibody that 
specifically recognizes TANGO 253, TANGO 257, INTERCEPT 258, and TANGO 281, 
the method comprising immunizing a mammal with a polypeptide. In one embodiment, 
the polypeptide used as an immunogens comprises an amino acid sequence of any one of 
SEQ IDNOs:3, 10, 17, 23,28, 39, 48,58, 102, 104, 106, 108, 110, 112, 114, 116, 118, 
120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 
156, 158, 160, 162 or 164, or an amino acid sequence encoded by the cDNA insert of a 
clone deposited with the ATCC® as Accession Number 207222, Accession Number 
207215, Accession Number 207217, Accession Number 207221, or patent deposit 
Number PTA-224. In another embodiment, the polypeptide used as an immunogen 
comprises a fragment of at least 15 amino acid residues, preferably at least 25 amino acid 
residues, of the amino acid sequence of any one of SEQ ID NOs:3, 10, 17, 23, 28, 39, 48, 
58, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 
136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162 or 164, or an amino 
acid sequence which is at least 85%, preferably at least 95% identical to the amino acid 
sequence of any one of SEQ ID NOs:3, 10, 17, 23,28, 39, 48, 58, 102, 104, 106, 108, 110, 
112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 
148, 150, 152, 154, 156, 158, 160, 162 or 164, wherein the percent identity is determined 
using the ALIGN program of the GCG software package with a PAM120 weight residue 
table, a gap length penalty of 12, and a gap penalty of 4. 

In another embodiment, the polypeptide used as an immunogen comprises an 
amino acid sequence which is encoded by a nucleic acid molecule which hybridizes to the 
nucleic acid molecule consisting of any one of SEQ ID NOs:l, 2, 8, 9, 15, 16, 21, 22, 26, 
27,37,38, 46, 47, 56, 57, 77, 80,91, 100, 101, 103, 105, 107, 109, 111, 113, 115, 117, 
119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 
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155, 157, 159, 161, 163, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 
178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191 or 192, orthecDNA 
insert of a clone deposited with the ATCC® as Accession Number 207222, Accession 
Number 207215, Accession Number 207217, Accession Number 207221, or patent 

5 deposit Number PTA-224, or a complement thereof, under conditions of hybridization of 
6X SSC at 45°C and washing in 0.2 X SSC, 0.1% SDS at 50°C, 55°C, 60°C or 65°C 
After immunization, a sample is collected from the mammal that contains an antibody that 
specifically recognizes TANGO 253, TANGO 257, INTERCEPT 258 or TANGO 281, a 
fragment thereof, or allellic variant thereof. Preferably, the polypeptide is recombinantly 

1 0 produced using a non-human host cell. Optionally, the antibodies can be further purified 
from the sample using techniques well known to those of skill in the art. The method can 
further comprise producing a monoclonal antibody- producing cell from the cells of the 
mammal Optionally, antibodies are collected from the antibody-producing cell 

15 

III. Recombinant Expression Vectors and Host Cells 

Another aspect of the invention pertains to vectors, preferably expression vectors, 
containing a nucleic acid encoding a polypeptide of the invention (or a portion thereof). 
As used herein, the term "vector" refers to a nucleic acid molecule capable of transporting 

20 another nucleic acid to which it has been linked. One type of vector is a "plasmid", which 
refers to a circular double stranded DNA loop into which additional DNA segments can be 
ligated. Another type of vector is a viral vector, wherein additional DNA segments can be 
ligated into the viral genome. Certain vectors are capable of autonomous replication in a 
host cell into which they are introduced (e.g., bacterial vectors having a bacterial origin of 

25 replication and episomal mammalian vectors). Other vectors (e.g., non-episomal 

mammalian vectors) are integrated into the genome of a host cell upon introduction into 
the host cell, and thereby are replicated along with the host genome. Moreover, certain 
vectors, expression vectors, are capable of directing the expression of genes to which they 
are operably linked. In general, expression vectors of utility in recombinant DNA 

30 techniques are often in the form of plasmids (vectors). However, the invention is intended 
to include such other forms of expression vectors, such as viral vectors (e.g., replication 
defective retroviruses, adenoviruses and adeno-associated viruses), which serve equivalent 
functions. 

The recombinant expression vectors of the invention comprise a nucleic acid of the 
35 invention in a form suitable for expression of the nucleic acid in a host cell. This means 
that the recombinant expression vectors include one or more regulatory sequences, 
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selected on the basis of the host cells to be used for expression, which is operably linked to 
the nucleic acid sequence to be expressed. Within a recombinant expression vector, 
"operably linked" is intended to mean that the nucleotide sequence of interest is linked to 
the regulatory sequence(s) in a manner which allows for expression of the nucleotide 

5 sequence (e.g., in an in vitro transcription/translation system or in a host cell when the 
vector is introduced into the host cell). The term "regulatory sequence" is intended to 
include promoters, enhancers and other expression control elements (e.g., polyadenylation 
signals). Such regulatory sequences are described, for example, in Goeddel, Gene 
Expression Technology: Methods in Enzymology 185, Academic Press, San Diego, CA 

10 (1 990). Regulatory sequences include those which direct constitutive expression of a 
nucleotide sequence in many types of host cell and those which direct expression of the 
nucleotide sequence only in certain host cells (e.g., tissue-specific regulatory sequences). 
It will be appreciated by those skilled in the art that the design of the expression vector can 
depend on such factors as the choice of the host cell to be transformed, the level of 

1 5 expression of protein desired, etc. The expression vectors of the invention can be 
introduced into host cells to thereby produce proteins or peptides, including fusion 
proteins or peptides, encoded by nucleic acids as described herein. 

The recombinant expression vectors of the invention can be designed for 
expression of a polypeptide of the invention in prokaryotic (e.g., E. coli ) or eukaryotic 
2 ^ cells (e.g., insect cells (using baculovirus expression vectors), yeast cells or mammalian 
cells). Suitable host cells are discussed further in Goeddel, supra. Alternatively, the 
recombinant expression vector can be transcribed and translated in vitro, for example 
using T7 promoter regulatory sequences and T7 polymerase. 

Expression of proteins in prokaryotes is most often carried out in E. coli with 
vectors containing constitutive or inducible promoters directing the expression of either 
fusion or non-fusion proteins. Fusion vectors add a number of amino acids to a protein 
encoded therein, usually to the amino terminus of the recombinant protein. Such fusion 
vectors typically serve three purposes: 1) to increase expression of recombinant protein; 2) 
to increase the solubility of the recombinant protein; and 3) to aid in the purification of the 
recombinant protein by acting as a ligand in affinity purification. Often, in fusion 
expression vectors, a proteolytic cleavage site is introduced at the junction of the fusion 
moiety and the recombinant protein to enable separation of the recombinant protein from 
the fusion moiety subsequent to purification of the fusion protein. Such enzymes, and 
their cognate recognition sequences, include Factor Xa, thrombin and enterokinase. 

35 

Typical fusion expression vectors include pGEX (Pharmacia Biotech Inc; Smith and 
Johnson (1988) Gene 67:31-40), pMAL (New England Biolabs, Beverly, MA) and pRIT5 
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(Pharmacia, Piscataway, NJ) which fuse glutathione S-transferase (GST), maltose E 
binding protein, or protein A, respectively, to the target recombinant protein. 

Examples of suitable inducible non-fusion E. coli expression vectors include pTrc 
(Amann et al., (1988) Gene 69:301-315) and pET 1 Id (Studier et al., Gene Expression 
Technology: Methods in Enzymology 185, Academic Press, San Diego, California (1990) 
60-89). Target gene expression from the pTrc vector relies on host RNA polymerase 
transcription from a hybrid trp-lac fusion promoter. Target gene expression from the pET 
1 Id vector relies on transcription from a T7 gnlO-lac fusion promoter mediated by a 
coexpressed viral RNA polymerase (T7 gnl). This viral polymerase is supplied by host 
strains BL21(DE3) or HMS174(DE3) from a resident X prophage harboring a T7 gnl gene 
under the transcriptional control of the lacUV 5 promoter. 

One strategy to maximize recombinant protein expression in E. coli is to express 
the protein in a host bacteria with an impaired capacity to proteolytically cleave the 
recombinant protein (Gottesman, Gene Expression Technology: Methods in Enzymology 
185, Academic Press, San Diego, California (1990) 1 19-128). Another strategy is to alter 
the nucleic acid sequence of the nucleic acid to be inserted into an expression vector so 
that the individual codons for each amino acid are those preferentially utilized in E. coli 
(WadaetaL (1992) Nucleic Acids Res. 20:2111-2118). Such alteration of nucleic acid 
sequences of the invention can be carried out by standard DNA synthesis techniques. 

In another embodiment, the expression vector is a yeast expression vector. 
Examples of vectors for expression in yeast S. cerivisae include pYepSecl (Baldari et al. 
(1987) EMBOX 6:229-234), pMFa (Kurjan and Herskowitz, (1982) Cell 30:933-943), 
pJRY88 (Schultz et al. (1987) Gene 54:1 13-123), pYES2 (Invitrogen Corporation, San 
Diego, CA), and pPicZ (Invitrogen Corp, San Diego, CA). 

Alternatively, the expression vector is a baculovirus expression vector. 
Baculovirus vectors available for expression of proteins in cultured insect cells (e.g., Sf 9 
cells) include the pAc series (Smith et al. (1983) Mol Cell Biol 3:21 56-2165) and the 
pVL series (Lucklow and Summers (1989) Virology 170:31-39). 

In yet another embodiment, a nucleic acid of the invention is expressed in 
mammalian cells using a mammalian expression vector. Examples of mammalian 
expression vectors include pCDM8 (Seed (1987) Nature 329:840) and pMT2PC 
(Kaufman et al. (1987) EMBOJ. 6:187-195). When used in mammalian cells, the 
expression vector's control functions are often provided by viral regulatory elements. For 
example, commonly used promoters are derived from polyoma, Adenovirus 2, 
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cytomegalovirus and Simian Virus 40. For other suitable expression systems for both 
prokaryotic and eukaryotic cells see chapters 16 and 17 of Sambrook et al., supra. 

In another embodiment, the recombinant mammalian expression vector is capable 
of directing expression of the nucleic acid preferentially in a particular cell type {e.g., 

5 tissue-specific regulatory elements are used to express the nucleic acid). Tissue-specific 
regulatory elements are known in the art. Non-limiting examples of suitable 
tissue-specific promoters include the albumin promoter (liver-specific; Pinkert et al. 
(1987) Genes Dev. 1:268-277), lymphoid-specific promoters (Calame and Eaton (1988) 
Adv. Immunol. 43:235-275), in particular promoters of T cell receptors (Winoto and 

10 Baltimore (1989) EMBOJ. 8:729-733) and immunoglobulins (Banerji et al. (1983) Cell 
33:729-740; Queen and Baltimore (1983) Cell 33:741-748), neuron-specific promoters 
(e.g., the neurofilament promoter; Byrne and Ruddle (1989) Proc. Natl Acad. Sci. USA 
86:5473-5477), pancreas-specific promoters (Edlund et al. (1985) Science 230:912-916), 
and mammary gland-specific promoters (e.g., milk whey promoter; U.S. Patent No. 

1 5 4,873,316 and European Application Publication No. 264,166). 

Developmentally-regulated promoters are also encompassed, for example the mouse hox 
promoters (Kessel and Gruss (1990) Science 249:374-379) and the beta- fetoprotein 
promoter (Campes and Tilghman (1989) Genes Dev. 3:537-546). 

The invention further provides a recombinant expression vector comprising a DNA 
20 .... 
molecule of the invention cloned into the expression vector in an antisense orientation. 

That is, the DNA molecule is operably linked to a regulatory sequence in a manner which 

allows for expression (by transcription of the DNA molecule) of an RNA molecule which 

is antisense to the mRNA encoding a polypeptide of the invention. Regulatory 

sequences operably linked to a nucleic acid cloned in the antisense orientation can be 

chosen which direct the continuous expression of the antisense RNA molecule in a variety 

of cell types, for instance viral promoters and/or enhancers, or regulatory sequences can be 

chosen which direct constitutive, tissue specific or cell type specific expression of 

antisense RNA. The antisense expression vector can be in the form of a recombinant 

plasmid, phagemid or attenuated virus in which antisense nucleic acids are produced under 

the control of a high efficiency regulatory region, the activity of which can be determined 

by the cell type into which the vector is introduced. For a discussion of the regulation of 

gene expression using antisense genes see Weintraub et al. (Reviews - Trends in Genetics, 

Vol. 1(1) 1986). 

35 Another aspect of the invention pertains to host cells into which a recombinant 

expression vector of the invention has been introduced. The terms "host cell M and 
"recombinant host cell" are used interchangeably herein. It is understood that such terms 
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refer not only to the particular subject cell but to the progeny or potential progeny of such 
a cell Because certain modifications may occur in succeeding generations due to either 
mutation or environmental influences, such progeny may not, in fact, be identical to the 
parent cell, but are still included within the scope of the term as used herein. 

5 A host cell can be any prokaryotic (e.g., E. coli) or eukaryotic cell (e.g., insect 

cells, yeast or mammalian cells). 

Vector DNA can be introduced into prokaryotic or eukaryotic cells via 
conventional transformation or transfection techniques. As used herein, the terms 
"transformation" and "transfection" are intended to refer to a variety of art-recognized 
techniques for introducing foreign nucleic acid into a host cell, including calcium 
phosphate or calcium chloride co-precipitation, DEAE-dextran-mediated transfection, 
lipofection, or electroporation. Suitable methods for transforming or transfecting host 
cells can be found in Sambrook, et al. (supra), and other laboratory manuals. 

15 For stable transfection of mammalian cells, it is known that, depending upon the 

expression vector and transfection technique used, only a small fraction of cells may 
integrate the foreign DNA into their genome. In order to identify and select these 
integrants, a gene that encodes a selectable marker (e.g, for resistance to antibiotics) is 
generally introduced into the host cells along with the gene of interest. Preferred 

20 selectable markers include those which confer resistance to drugs, such as G41 8, 

hygromycin and methotrexate. Cells stably transfected with the introduced nucleic acid 
can be identified by drug selection (e.g., cells that have incorporated the selectable marker 
gene will survive, while the other cells die). 

In another embodiment, the expression characteristics of an endogenous (e.g., 
25 TANGO 253, TANGO 257, INTERCEPT 258 and TANGO 281 genes) within a cell, cell 
line or microorganism may be modified by inserting a DNA regulatory element 
heterologous to the endogenous gene of interest into the genome of a cell, stable cell line 
or cloned microorganism such that the inserted regulatory element is operatively linked 
with the endogenous gene (e.g., TANGO 253, TANGO 257, INTERCEPT 258 and 
30 TANGO 281 genes) and controls, modulates or activates. For example, endogenous 

TANGO 253, TANGO 257, INTERCEPT 258 and TANGO 281 genes which are normally 
"transcriptionally silent", i.e., a TANGO 253, TANGO 257, INTERCEPT 258 and 
TANGO 281 genes which is normally not expressed, or are expressed only at very low 
levels in a cell line or microorganism, may be activated by inserting a regulatory element 
35 which is capable of promoting the expression of a normally expressed gene product in that 
cell line or microorganism. Alternatively, transcriptionally silent, endogenous TANGO 
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253, TANGO 257, INTERCEPT 258 and TANGO 281 genes may be activated by 
insertion of a promiscuous regulatory element that works across cell types. 

A heterologous regulatory element may be inserted into a stable cell line or cloned 
microorganism, such that it is operatively linked with endogenous TANGO 253, TANGO 
5 257, INTERCEPT 258 and TANGO 281 genes, using techniques, such as targeted 
homologous recombination, which are well known to those of skill in the art, and 
described e.g., in Chappel, U.S. Patent No. 5,272,071; PCT publication No. WO 
91/06667, published May 16, 1991. 

A host cell of the invention, such as a prokaryotic or eukaryotic host cell in culture, 
can be used to produce a polypeptide of the invention. Accordingly, the invention further 
provides methods for producing a polypeptide of the invention using the host cells of the 
invention. In one embodiment, the method comprises culturing the host cell of invention 
(into which a recombinant expression vector encoding a polypeptide of the invention has 
been introduced) in a suitable medium such that the polypeptide is produced. In another 
embodiment, the method further comprises isolating the polypeptide from the medium or 
the host cell. 

The host cells of the invention can also be used to produce nonhuman transgenic 
animals. For example, in one embodiment, a host cell of the invention is a fertilized 

20 oocyte or an embryonic stem cell into which a sequence encoding a polypeptide of the 
invention has been introduced. Such host cells can then be used to create non-human 
transgenic animals in which exogenous sequences encoding a polypeptide of the invention 
have been introduced into their genome or homologous recombinant animals in which 
endogenous encoding a polypeptide of the invention sequences have been altered. Such 

25 animals are useful for studying the function and/or activity of the polypeptide and for 
identifying and/or evaluating modulators of polypeptide activity. As used herein, a 
"transgenic animal" is a non-human animal, preferably a mammal, more preferably a 
rodent such as a rat or mouse, in which one or more of the cells of the animal includes a 
transgene. Other examples of transgenic animals include non-human primates, sheep, 

30 dogs, cows, goats, chickens, amphibians, etc. A transgene is exogenous DNA which is 
integrated into the genome of a cell from which a transgenic animal develops and which 
remains in the genome of the mature animal, thereby directing the expression of an 
encoded gene product in one or more cell types or tissues of the transgenic animal. As 
used herein, an "homologous recombinant animal" is a non-human animal, preferably a 

3 5 mammal, more preferably a mouse, in which an endogenous gene has been altered by 
homologous recombination between the endogenous gene and an exogenous DNA 
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10 



molecule introduced into a cell of the animal, e.g., an embryonic cell of the animal, prior 
to development of the animal. 

A transgenic animal of the invention can be created by introducing nucleic acid 
encoding a polypeptide of the invention (or a homologue thereof) into the male pronuclei 
of a fertilized oocyte, e.g., by microinjection, retroviral infection, and allowing the oocyte 
to develop in a pseudopregnant female foster animal. Intronic sequences and 
polyadenylation signals can also be included in the transgene to increase the efficiency of 
expression of the transgene. A tissue-specific regulatory sequence(s) can be operably 
linked to the transgene to direct expression of the polypeptide of the invention to particular 
cells. Methods for generating transgenic animals via embryo manipulation and 
microinjection, particularly animals such as mice, have become conventional in the art and 
are described, for example, in U.S. Patent Nos. 4,736,866 and 4,870,009, U.S. Patent No. 
4,873,191 and in Hogan, Manipulating the Mouse Embryo, (Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, N.Y., 1986) and Wakayama et al, (1999), Proc. 
15 Natl. Acad. Scu USA, 96: 14984-14989. Similar methods are used for production of other 
transgenic animals. A transgenic founder animal can be identified based upon the 
presence of the transgene in its genome and/or expression of mRNA encoding the 
transgene in tissues or cells of the animals. A transgenic founder animal can then be used 
to breed additional animals carrying the transgene. Moreover, transgenic animals carrying 
the transgene can further be bred to other transgenic animals carrying other transgenes. 

To create an homologous recombinant animal, a vector is prepared which contains 
at least a portion of a gene encoding a polypeptide of the invention into which a deletion, 
addition or substitution has been introduced to thereby alter, e.g., functionally disrupt, the 
gene. In a preferred embodiment, the vector is designed such that, upon homologous 
recombination, the endogenous gene is functionally disrupted (i.e., no longer encodes a 
functional protein; also referred to as a "knock out" vector). Alternatively, the vector can 
be designed such that, upon homologous recombination, the endogenous gene is mutated 
or otherwise altered but still encodes functional protein {e.g., the upstream regulatory 
region can be altered to thereby alter the expression of the endogenous protein). In the 
homologous recombination vector, the altered portion of the gene is flanked at its 5* and 3 1 
ends by additional nucleic acid of the gene to allow for homologous recombination to 
occur between the exogenous gene carried by the vector and an endogenous gene in an 
embryonic stem cell. The additional flanking nucleic acid sequences are of sufficient 
length for successful homologous recombination with the endogenous gene. Typically, 
several kilobases of flanking DNA (both at the 5' and 3* ends) are included in the vector 
{see, e.g., Thomas and Capecchi (1987) Cell 51:503 for a description of homologous 
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recombination vectors). The vector is introduced into an embryonic stem cell line (e.g., by 
electroporation) and cells in which the introduced gene has homologously recombined 
with the endogenous gene are selected (see, e.g., Li et al. (1992) Cell 69:915). The 
selected cells are then injected into a blastocyst of an animal (e.g., a mouse) to form 

5 aggregation chimeras (see, e.g., Bradley in Teratocarcinomas and Embryonic Stem Cells: 
A Practical Approach, Robertson, ed. (IRL, Oxford, 1987) pp. 1 13-152). A chimeric 
embryo can then be implanted into a suitable pseudopregnant female foster animal and the 
embryo brought to term. Progeny harboring the homologously recombined DNA in their 
germ cells can be used to breed animals in which all cells of the animal contain the 

10 homologously recombined DNA by germline transmission of the transgene. Methods for 
constructing homologous recombination vectors and homologous recombinant animals are 
described further in Bradley (1991) Current Opinion in Bio/Technology 2:823-829 and in 
PCT Publication Nos. WO 90/1 1354, WO 91/01 140, WO 92/0968, and WO 93/04169. 

In another embodiment, transgenic non-human animals can be produced which 
* 5 contain selected systems which allow for regulated expression of the transgene. One 

example of such a system is the cre/loxP recombinase system of bacteriophage PI. For a 
description of the cre/loxP recombinase system, see, e.g., Lakso et al. (1 992) Proa Natl 
Acad. Sci. USA 89:6232-6236. Another example of a recombinase system is the FLP 
recombinase system of Saccharomyces cerevisiae (O'Gorman et al. (1991) Science 
20 251:1351-1355. If a cre/loxP recombinase system is used to regulate expression of the 
transgene, animals containing transgenes encoding both the Cre recombinase and a 
selected protein are required. Such animals can be provided through the construction of 
"double" transgenic animals, e.g., by mating two transgenic animals, one containing a 
transgene encoding a selected protein and the other containing a transgene encoding a 
25 recombinase. 

Clones of the non-human transgenic animals described herein can also be produced 
according to the methods described in Wilmut et al. (1997) Nature 385:810-813 and PCT 
Publication NOS. WO 97/07668 and WO 97/07669. 

30 

IV. Pharmaceutical Compositions 

The nucleic acid molecules, polypeptides, and antibodies (also referred to herein as 
"active compounds") of the invention can be incorporated into pharmaceutical 
compositions suitable for administration. Such compositions typically comprise the 

35 

nucleic acid molecule, protein, or antibody and a pharmaceutical^ acceptable carrier. As 
used herein the language "pharmaceutically acceptable carrier" is intended to include any 
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and all solvents, dispersion media, coatings, antibacterial and antifungal agents, isotonic 
and absorption delaying agents, and the like, compatible with pharmaceutical 
administration. The use of such media and agents for pharmaceutical^ active substances 
is well known in the art. Except insofar as any conventional media or agent is 
5 incompatible with the active compound, use thereof in the compositions is contemplated. 
Supplementary active compounds can also be incorporated into the compositions. 

The invention includes methods for preparing pharmaceutical compositions for 
modulating the expression or activity of a polypeptide or nucleic acid of the invention. 
Such methods comprise formulating a pharmaceutical^ acceptable carrier with an agent 

10 which modulates expression or activity of a polypeptide or nucleic acid of the invention. 
Such compositions can further include additional active agents. Thus, the invention 
further includes methods for preparing a pharmaceutical composition by formulating a 
pharmaceutically acceptable carrier with an agent which modulates expression or activity 
of a polypeptide or nucleic acid of the invention and one or more additional active 

15 compounds. 

A pharmaceutical composition of the invention is formulated to be compatible with 
its intended route of administration. Examples of routes of administration include 
parenteral, e.g., intravenous, intradermal, subcutaneous, oral (e.g., inhalation), transdermal 
(topical), transmucosal, and rectal administration. Solutions or suspensions used for 
parenteral, intradermal, or subcutaneous application can include the following 
components: a sterile diluent such as water for injection, saline solution, fixed oils, 
polyethylene glycols, glycerine, propylene glycol or other synthetic solvents; antibacterial 
agents such as benzyl alcohol or methyl parabens; antioxidants such as ascorbic acid or 
^ sodium bisulfite; chelating agents such as ethylenediaminetetraacetic acid; buffers such as 
acetates, citrates or phosphates and agents for the adjustment of tonicity such as sodium 
chloride or dextrose. pH can be adjusted with acids or bases, such as hydrochloric acid or 
sodium hydroxide. The parenteral preparation can be enclosed in ampoules, disposable 
syringes or multiple dose vials made of glass or plastic. 

30 Pharmaceutical compositions suitable for injectable use include sterile aqueous 

solutions (where water soluble) or dispersions and sterile powders for the extemporaneous 
preparation of sterile injectable solutions or dispersions. For intravenous administration, 
suitable carriers include physiological saline, bacteriostatic water, Cremophor EL™ 
(BASF; Parsippany, NJ) or phosphate buffered saline (PBS). In all cases, the composition 

3 5 must be sterile and should be fluid to the extent that easy syringability exists. It must be 
stable under the conditions of manufacture and storage and must be preserved against the 
contaminating action of microorganisms such as bacteria and fungi. The carrier can be a 
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solvent or dispersion medium containing, for example, water, ethanol, polyol (for 
example, glycerol, propylene glycol, and liquid polyetheylene glycol, and the like), and 
suitable mixtures thereof The proper fluidity can be maintained, for example, by the use 
of a coating such as lecithin, by the maintenance of the required particle size in the case of 

5 dispersion and by the use of surfactants. Prevention of the action of microorganisms can 
be achieved by various antibacterial and antifungal agents, for example, parabens, 
chlorobutanol, phenol, ascorbic acid, thimerosal, and the like. In many cases, it will be 
preferable to include isotonic agents, for example, sugars, polyalcohols such as mannitol, 
sorbitol, sodium chloride in the composition. Prolonged absorption of the injectable 

10 compositions can be brought about by including in the composition an agent which delays 
absorption, for example, aluminum monostearate and gelatin. 

Sterile injectable solutions can be prepared by incorporating the active compound 
(e.g., 2l polypeptide or antibody) in the required amount in an appropriate solvent with one 
or a combination of ingredients enumerated above, as required, followed by filtered 

15 sterilization. Generally, dispersions are prepared by incorporating the active compound 
into a sterile vehicle which contains a basic dispersion medium and the required other 
ingredients from those enumerated above. In the case of sterile powders for the 
preparation of sterile injectable solutions, the preferred methods of preparation are vacuum 
drying and freeze-drying which yields a powder of the active ingredient plus any 

2 ^ additional desired ingredient from a previously sterile-filtered solution thereof. 

Oral compositions generally include an inert diluent or an edible carrier. They can 

be enclosed in gelatin capsules or compressed into tablets. For the purpose of oral 

therapeutic administration, the active compound can be incorporated with excipients and 

used in the form of tablets, troches, or capsules. Oral compositions can also be prepared 
25 ... 
using a fluid carrier for use as a mouthwash, wherein the compound in the fluid carrier is 

applied orally and swished and expectorated or swallowed. • 

Pharmaceutical^ compatible binding agents, and/or adjuvant materials can be 
included as part of the composition. The tablets, pills, capsules, troches and the like can 

3Q contain any of the following ingredients, or compounds of a similar nature: a binder such 
as microcrystalline cellulose, gum tragacanth or gelatin; an excipient such as starch or 
lactose, a disintegrating agent such as alginic acid, Primogel, or corn starch; a lubricant 
such as magnesium stearate or Sterotes; a glidant such as colloidal silicon dioxide; a 
sweetening agent such as sucrose or saccharin; or a flavoring agent such as peppermint, 

3 5 methyl salicylate, or orange flavoring. 
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For administration by inhalation, the compounds are delivered in the form of an 
aerosol spray from a pressurized container or dispenser which contains a suitable 
propellant, e.g., a gas such as carbon dioxide, or a nebulizer. 

Systemic administration can also be by transmucosal or transdermal means. For 
^ transmucosal or transdermal administration, penetrants appropriate to the barrier to be 
permeated are used in the formulation. Such penetrants are generally known in the art, 
and include, for example, for transmucosal administration, detergents, bile salts, and 
fusidic acid derivatives. Transmucosal administration can be accomplished through the 
use of nasal sprays or suppositories. For transdermal administration, the active 
!0 compounds are formulated into ointments, salves, gels, or creams as generally known in 
the art. 

The compounds can also be prepared in the form of suppositories (e.g., with 
conventional suppository bases such as cocoa butter and other glycerides) or retention 
enemas for rectal delivery. 

In one embodiment, the active compounds are prepared with carriers that will 
protect the compound against rapid elimination from the body, such as a controlled release 
formulation, including implants and microencapsulated delivery systems. Biodegradable, 
biocompatible polymers can be used, such as ethylene vinyl acetate, polyanhydrides, 

20 polyglycolic acid, collagen, polyorthoesters, and polylactic acid. Methods for preparation 
of such formulations will be apparent to those skilled in the art. The materials can also be 
obtained commercially from Alza Coiporation and Nova Pharmaceuticals, Inc. Liposomal 
suspensions (including liposomes targeted to infected cells with monoclonal antibodies to 
viral antigens) can also be used as pharmaceutical^ acceptable carriers. These can be 

25 prepared according to methods known to those skilled in the art, for example, as described 
in U.S. Patent No. 4,522,811. 

It is especially advantageous to formulate oral or parenteral compositions in 
dosage unit form for ease of administration and uniformity of dosage. Dosage unit form 
as used herein refers to physically discrete units suited as unitary dosages for the subject to 

30 be treated; each unit containing a predetermined quantity of active compound calculated to 
produce the desired therapeutic effect in association with the required pharmaceutical 
carrier. The specification for the dosage unit forms of the invention are dictated by and 
directly dependent on the unique characteristics of the active compound and the particular 
therapeutic effect to be achieved, and the limitations inherent in the art of compounding 

35 such an active compound for the treatment of individuals. 
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For antibodies, the preferred dosage is 0.1 mg/kg to 100 mg/kg of body weight 
(generally 10 mg/kg to 20 mg/kg). If the antibody is to act in the brain, a dosage of 50 
mg/kg to 100 mg/kg is usually appropriate. Generally, partially human antibodies and 
fully human antibodies have a longer half-life within the human body than other 
5 antibodies. Accordingly, lower dosages and less frequent administration is often possible. 
Modifications such as lipidation can be used to stabilize antibodies and to enhance uptake 
and tissue penetration (e.g., into the brain). A method for lipidation of antibodies is 
described by Cruikshank et al. ((1997) J. Acquired Immune Deficiency Syndromes and 
Human Retrovirology 14:193). 

10 As defined herein, a therapeutically effective amount of protein or polypeptide 

(i.e., an effective dosage) ranges from about 0.001 to 30 mg/kg body weight, preferably 
about 0.01 to 25 mg/kg body weight, more preferably about 0.1 to 20 mg/kg body weight, 
and even more preferably about 1 to 10 mg/kg, 2 to 9 mg/kg, 3 to 8 mg/kg, 4 to 7 mg/kg, 
or 5 to 6 mg/kg body weight. 

15 

The skilled artisan will appreciate that certain factors may influence the dosage 
required to effectively treat a subject, including but not limited to the severity of the 
disease or disorder, previous treatments, the general health and/or age of the subject, and 
other diseases present. Moreover, treatment of a subject with a therapeutically effective 
^ amount of a protein, polypeptide, or antibody can include a single treatment or, preferably, 
can include a series of treatments. In a preferred example, a subject is treated with 
antibody, protein, or polypeptide in the range of between about 0.1 to 20 mg/kg body 
weight, one time per week for between about 1 to 10 weeks, preferably between 2 to 8 
weeks, more preferably between about 3 to 7 weeks, and even more preferably for about 4, 
5, or 6 weeks. It will also be appreciated that the effective dosage of antibody, protein, or 

25 

polypeptide used for treatment may increase or decrease over the course of a particular 
treatment. Changes in dosage may result and become apparent from the results of 
diagnostic assays as described herein. 

The present invention encompasses agents which modulate expression or activity. 

30 An agent may, for example, be a small molecule. For example, such small molecules 
include, but are not limited to, peptides, peptidomimetics, amino acids, amino acid 
analogs, polynucleotides, polynucleotide analogs, nucleotides, nucleotide analogs, organic 
or inorganic compounds (i.e,. including heteroorganic and organometallic compounds) 
having a molecular weight less than about 10,000 grams per mole, organic or inorganic 

35 compounds having a molecular weight less than about 5,000 grams per mole, organic or 
inorganic compounds having a molecular weight less than about 1,000 grams per mole, 
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organic or inorganic compounds having a molecular weight less than about 500 grams per 
mole, and salts, esters, and other pharmaceutical^ acceptable forms of such compounds. 

It is understood that appropriate doses of small molecule agents depends upon a 
number of factors within the ken of the ordinarily skilled physician, veterinarian, or 
researcher. The dose(s) of the small molecule will vary, for example, depending upon the 
identity, size, and condition of the subject or sample being treated, further depending upon 
the route by which the composition is to be administered, if applicable, and the effect 
which the practitioner desires the small molecule to have upon the nucleic acid or 
polypeptide of the invention. Exemplary doses include milligram or microgram amounts 
of the small molecule per kilogram of subject or sample weight (e.g., about 1 microgram 
per kilogram to about 500 milligrams per kilogram, about 100 micrograms per kilogram 
to about 5 milligrams per kilogram, or about 1 microgram per kilogram to about 50 
micrograms per kilogram. It is furthermore understood that appropriate doses of a small 
molecule depend upon the potency of the small molecule with respect to the expression or 
activity to be modulated. Such appropriate doses may be determined using the assays 
described herein. When one or more of these small molecules is to be administered to an 
animal (e.g., a human) in order to modulate expression or activity of a polypeptide or 
nucleic acid of the invention, a physician, veterinarian, or researcher may, for example, 
prescribe a relatively low dose at first, subsequently increasing the dose until an 
appropriate response is obtained. In addition, it is understood that the specific dose level 
for any particular animal subject will depend upon a variety of factors including the 
activity of the specific compound employed, the age, body weight, general health, gender, 
and diet of the subject, the time of administration, the route of administration, the rate of 
excretion, any drug combination, and the degree of expression or activity to be modulated. 

The nucleic acid molecules of the invention can be inserted into vectors and used 
as gene therapy vectors. Gene therapy vectors can be delivered to a subject by, for 
example, intravenous injection, local administration (U.S. Patent 5,328,470) or by 
stereotactic injection (see, e.g., Chen et al. (1994) Proc. Natl. Acad. Sci. USA 
91 :3054-3057). The pharmaceutical preparation of the gene therapy vector can include the 
gene therapy vector in an acceptable diluent, or can comprise a slow release matrix in 
which the gene delivery vehicle is imbedded. Alternatively, where the complete gene 
delivery vector can be produced intact from recombinant cells, e.g., retroviral vectors, the 
pharmaceutical preparation can include one or more cells which produce the gene delivery 
system. 

The pharmaceutical compositions can be included in a container, pack, or 
dispenser together with instructions for administration. 
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V. Uses and Methods of the Invention 

The nucleic acid molecules, proteins, protein homologues, and antibodies 
described herein can be used in one or more of the following methods: a) screening assays; 
b) detection assays {e.g., chromosomal mapping, tissue typing, forensic biology); c) 
predictive medicine (eg., diagnostic assays, prognostic assays, monitoring clinical trials, 
and pharmacogenomics); and d) methods of treatment {e.g., therapeutic and prophylactic). 
The isolated nucleic acid molecules of the invention can be used to express proteins {e.g., 
via a recombinant expression vector in a host cell in gene therapy applications), to detect 
mRNA (e.g., in a biological sample) or a genetic lesion, and to modulate activity of a 
polypeptide of the invention. In addition, the polypeptides of the invention can be used to 
screen drugs or compounds which modulate activity or expression of a polypeptide of the 
invention as well as to treat disorders characterized by insufficient or excessive production 
of a protein of the invention or production of a form of a protein of the invention which 
has decreased or aberrant activity compared to the wild type protein. In addition, the 
antibodies of the invention can be used to detect and isolate a protein of the and modulate 
activity of a protein of the invention. 

This invention further pertains to novel agents identified by the 
above-described screening assays and uses thereof for treatments as described herein. 

A. Screening Assays 

The invention provides a method (also referred to herein as a "screening assay") 
for identifying modulators, i.e., candidate or test compounds or agents (e.g., peptides, 
peptidomimetics, small molecules or other drugs) which bind to polypeptide of the 
invention or have a stimulatory or inhibitory effect on, for example, expression or activity 
of a polypeptide of the invention. 

In one embodiment, the invention provides assays for screening candidate or test 
compounds which bind to or modulate the activity of the membrane-bound form of a 
polypeptide of the invention or biologically active portion thereof. The test compounds of 
the present invention can be obtained using any of the numerous approaches in 
combinatorial library methods known in the art, including: biological libraries; spatially 
addressable parallel solid phase or solution phase libraries; synthetic library methods 
requiring deconvolution; the "one-bead one-compound" library method; and synthetic 
library methods using affinity chromatography selection. The biological library approach 
is limited to peptide libraries, while the other four approaches are applicable to peptide, 
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non-peptide oligomer or small molecule libraries of compounds (Lam (1997) Anticancer 
DrugDes. 12:145). 

Examples of methods for the synthesis of molecular libraries can be found in the 
art, for example in: DeWitt et al. (1993) Proc. Natl. Acad. Sci. USA 90:6909; Erb et al. 
5 (1994) Proc. Natl. Acad. Sci. USA 91:1 1422; Zuckermann et al. (1994). J. Med. Chem. 
37:2678; Cho et al. (1993) Science 261:1303; Carrell et al. (1994) Angew. Chem. Int. Ed. 
Engl. 33:2059; Carell et al. (1994) Angew. Chem. Int. Ed. Engl. 33:2061; and Gallop et al. 
(1994) J. Med. Chem. 37:1233. 

j Libraries of compounds may be presented in solution (e.g. , Houghten (1 992) 

Bio/Techniques 13:412-421), or on beads (Lam (1991) Nature 354:82-84), chips (Fodor 
(1993) Nature 364:555-556), bacteria (U.S. Patent No. 5,223,409), spores (Patent NOS. 
5,571,698; 5,403,484; and 5,223,409), plasmids (Cull et al. (1992) Proc. Natl. Acad. Sci. 
USA 89:1865-1869) or phage (Scott and Smith (1990) Science 249:386-390; Devlin 
(1990) Science 249:404-406; Cwirla et al. (1990) Proc. Natl. Acad. Sci. USA 
87:6378-6382; and Felici (1991)7. Mol. Biol. 222:301-310). 

In one embodiment, an assay is a cell-based assay in which a cell which expresses 
a membrane-bound form of a polypeptide of the invention, or a biologically active portion 
thereof, on the cell surface is contacted with a test compound and the ability of the test 

20 compound to bind to the polypeptide determined. The cell, for example, can be a yeast 
cell or a cell of mammalian origin. Determining the ability of the test compound to bind 
to the polypeptide can be accomplished, for example, by coupling the test compound with 
a radioisotope or enzymatic label such that binding of the test compound to the 
polypeptide or biologically active portion thereof can be determined by detecting the 

25 labeled compound in a complex. For example, test compounds can be labeled with 125^ 
35 S, 14 C, or 3h, either directly or indirectly, and the radioisotope detected by direct 
counting of radioemmission or by scintillation counting. Alternatively, test compounds 
can be enzymatically labeled with, for example, horseradish peroxidase, alkaline 
phosphatase, or luciferase, and the enzymatic label detected by determination of 

30 conversion of an appropriate substrate to product. In a preferred embodiment, the assay 
comprises contacting a cell which expresses a membrane-bound form of a polypeptide of 
the invention, or a biologically active portion thereof, on the cell surface with a known 
compound which binds the polypeptide to form an assay mixture, contacting the assay 
mixture with a test compound, and determining the ability of the test compound to interact 

35 with the polypeptide, wherein determining the ability of the test compound to interact with 
the polypeptide comprises determining the ability of the test compound to preferentially 
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bind to the polypeptide or a biologically active portion thereof as compared to the known 
compound. 

In another embodiment, an assay is a cell-based assay comprising contacting a cell 
expressing a membrane-bound form of a polypeptide of the invention, or a biologically 

^ active portion thereof, on the cell surface with a test compound and determining the ability 
of the test compound to modulate (e.g., stimulate or inhibit) the activity of the polypeptide 
or biologically active portion thereof. Determining the ability of the test compound to 
modulate the activity of the polypeptide or a biologically active portion thereof can be 
accomplished, for example, by determining the ability of the polypeptide protein to bind to 

1 0 or interact with a target molecule. 

Determining the ability of a polypeptide of the invention to bind to or interact with 
a target molecule can be accomplished by one of the methods described above for 
determining direct binding. As used herein, a "target molecule" is a molecule with which 
a selected polypeptide (e.g., a polypeptide of the invention) binds or interacts with in 
nature, for example, a molecule on the surface of a cell which expresses the selected 
protein, a molecule on the surface of a second cell, a molecule in the extracellular milieu, a 
molecule associated with the internal surface of a cell membrane or a cytoplasmic 
molecule. A target molecule can be a polypeptide of the invention or some other 
polypeptide or protein. For example, a target molecule can be a component of a signal 
transduction pathway which facilitates transduction of an extracellular signal (e.g., a signal 
generated by binding of a compound to a polypeptide of the invention) through the cell 
membrane and into the cell or a second intercellular protein which has catalytic activity or 
a protein which facilitates the association of downstream signaling molecules with a 
polypeptide of the invention. Determining the ability of a polypeptide of the invention to 
bind to or interact with a target molecule can be accomplished by determining the activity 
of the target molecule. For example, the activity of the target molecule can be determined 
by detecting induction of a cellular second messenger of the target (e.g., intracellular 
Ca 2 +, diacylglycerol, IP3, etc.), detecting catalytic/enzymatic activity of the target on an 
appropriate substrate, detecting the induction of a reporter gene (e.g., a regulatory element 
that is responsive to a polypeptide of the invention operably linked to a nucleic acid 
encoding a detectable marker, e.g., luciferase), or detecting a cellular response, for 
example, cellular differentiation, or cell proliferation. 

In yet another embodiment, an assay of the present invention is a cell-free assay 
35 comprising contacting a polypeptide of the invention or biologically active portion thereof 
with a test compound and determining the ability of the test compound to bind to the 
polypeptide or biologically active portion thereof. Binding of the test compound to the 
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polypeptide can be determined either directly or indirectly as described above. In a 
preferred embodiment, the assay includes contacting the polypeptide of the invention or 
biologically active portion thereof with a known compound which binds the polypeptide 
to form an assay mixture, contacting the assay mixture with a test compound, and 
5 determining the ability of the test compound to interact with the polypeptide, wherein 
determining the ability of the test compound to interact with the polypeptide comprises 
determining the ability of the test compound to preferentially bind to the polypeptide or 
biologically active portion thereof as compared to the known compound. 

In another embodiment, an assay is a cell-free assay comprising contacting a 
10 polypeptide of the invention or biologically active portion thereof with a test compound 
and determining the ability of the test compound to modulate {e.g., stimulate or inhibit) 
the activity of the polypeptide or biologically active portion thereof. Determining the 
ability of the test compound to modulate the activity of the polypeptide can be 
accomplished, for example, by determining the ability of the polypeptide to bind to a 
1 5 target molecule by one of the methods described above for determining direct binding. In 
an alternative embodiment, determining the ability of the test compound to modulate the 
activity of the polypeptide can be accomplished by determining the ability of the 
polypeptide of the invention to further modulate the target molecule. For example, the 
catalytic/enzymatic activity of the target molecule on an appropriate substrate can be 
2 ^ determined as previously described. 

In yet another embodiment, the cell-free assay comprises contacting a polypeptide 
of the invention or biologically active portion thereof with a known compound which 
binds the polypeptide to form an assay mixture, contacting the assay mixture with a test 
compound, and determining the ability of the test compound to interact with the 
polypeptide, wherein determining the ability of the test compound to interact with the 
polypeptide comprises determining the ability of the polypeptide to preferentially bind to 
or modulate the activity of a target molecule. 

The cell-free assays of the present invention are amenable to use of both a soluble 
30 form or the membrane-bound form of a polypeptide of the invention. In the case of 
cell-free assays comprising the membrane-bound form of the polypeptide, it may be 
desirable to utilize a solubilizing agent such that the membrane-bound form of the 
polypeptide is maintained in solution. Examples of such solubilizing agents include 
non-ionic detergents such as n-octylglucoside, n-dodecylglucoside, n-octylmaltoside, 
35 octanoyl-N-methylglucamide, decanoyl-N-methylglucamide, Triton X-100, Triton X-l 14, 
Thesit, Isotridecypoly(ethylene glycol ether)n, 

3-[(3-cholamidopropyl)dimethylamminio]-l-propane sulfonate (CHAPS), 
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3-[(3-cholamidopropyl)dimethylamminio]-2-hydroxy-l -propane sulfonate (CHAPSO), or 
N-dodecy l=N,N-dimethyl-3-ammonio- 1 -propane sulfonate. 

In more than one embodiment of the above assay methods of the present invention, 
it may be desirable to immobilize either the polypeptide of the invention or its target 
molecule to facilitate separation of complexed from uncomplexed forms of one or both of 
the proteins, as well as to accommodate automation of the assay. Binding of a test 
compound to the polypeptide, or interaction of the polypeptide with a target molecule in 
the presence and absence of a candidate compound, can be accomplished in any vessel 
suitable for containing the reactants. Examples of such vessels include microtitre plates, 
test tubes, and micro-centrifuge tubes. In one embodiment, a fusion protein can be 
provided which adds a domain that allows one or both of the proteins to be bound to a 
matrix. For example, glutathione-S-transferase fusion proteins or 
glutathione-S-transferase fusion proteins can be adsorbed onto glutathione sepharose 
beads (Sigma Chemical; St. Louis, MO) or glutathione derivatized microtitre plates, which 
are then combined with the test compound or the test compound and either the 
non-adsorbed target protein or A polypeptide of the invention, and the mixture incubated 
under conditions conducive to complex formation (e.g., at physiological conditions for salt 
and pH). Following incubation, the beads or microtitre plate wells are washed to remove 
any unbound components and complex formation is measured either directly or indirectly, 
for example, as described above. Alternatively, the complexes can be dissociated from the 
matrix, and the level of binding or activity of the polypeptide of the invention can be 
determined using standard techniques. 

Other techniques for immobilizing proteins on matrices can also be used in the 
screening assays of the invention. For example, either the polypeptide of the invention or 
its target molecule can be immobilized utilizing conjugation of biotin and streptavidin. 
Biotinylated polypeptide of the invention or target molecules can be prepared from 
biotin-NHS (N-hydroxy-succinimide) using techniques well known in the art (e.g., 
biotinyiation kit, Pierce Chemicals; Rockford, IL), and immobilized in the wells of 
streptavidin-coated 96 well plates (Pierce Chemical). Alternatively, antibodies reactive 
with the polypeptide of the invention or target molecules but which do not interfere with 
binding of the polypeptide of the invention to its target molecule can be derivatized to the 
wells of the plate, and unbound target or polypeptide of the invention trapped in the wells 
by antibody conjugation. Methods for detecting such complexes, in addition to those 
described above for the GST-immobilized complexes, include immunodetection of 
complexes using antibodies reactive with the polypeptide of the invention or target 
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molecule, as well as enzyme-linked assays which rely on detecting an enzymatic activity 
associated with the polypeptide of the invention or target molecule. 

In another embodiment, modulators of expression of a polypeptide of the invention 
are identified in a method in which a cell is contacted with a candidate compound and the 

5 expression of the selected mRNA or protein ( i.e. . the mRNA or protein corresponding to a 
polypeptide or nucleic acid of the invention) in the cell is determined. The level of 
expression of the selected mRNA or protein in the presence of the candidate compound is 
compared to the level of expression of the selected mRNA or protein in the absence of the 
candidate compound. The candidate compound can then be identified as a modulator of 

10 expression of the polypeptide of the invention based on this comparison. For example, 
when expression of the selected mRNA or protein is greater (statistically significantly 
greater) in the presence of the candidate compound than in its absence, the candidate 
compound is identified as a stimulator of the selected mRNA or protein expression. 
Alternatively, when expression of the selected mRNA or protein is less (statistically 

^ significantly less) in the presence of the candidate compound than in its absence, the 
candidate compound is identified as an inhibitor of the selected mRNA or protein 
expression. The level of the selected mRNA or protein expression in the cells can be 
determined by methods described herein. 

2Q In yet another aspect of the invention, a polypeptide of the inventions can be used 

as "bait proteins" in a two-hybrid assay or three hybrid assay (see, e.g., U.S. Patent No. 
5,283,317; Zervos et al. (1993) Cell 72:223-232; Madura et al. (1993) J. Biol Chem. 
268:12046-12054; Bartel et al. (1993) Bio/Techniques 14:920-924; Iwabuchi et al. (1993) 
Oncogene 8:1693-1696; and PCT Publication No. WO 94/10300), to identify other 
proteins, which bind to or interact with the polypeptide of the invention and modulate 
activity of the polypeptide of the invention. Such binding proteins are also likely to be 
involved in the propagation of signals by the polypeptide of the inventions as, for 
example, upstream or downstream elements of a signaling pathway involving the 
polypeptide of the invention. 

30 This invention further pertains to novel agents identified by the above-described 

screening assays and uses thereof for treatments as described herein. 



B. Detection Assays 

Portions or fragments of the cDNA sequences identified herein (and the 
corresponding complete gene sequences) can be used in numerous ways as polynucleotide 
reagents. For example, these sequences can be used to: (i) map their respective genes on a 
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chromosome and, thus, locate gene regions associated with genetic disease; (ii) identify an 
individual from a minute biological sample (tissue typing); and (iii) aid in forensic 
identification of a biological sample. These applications are described in the subsections 
below. 

5 

1. Chromosome Mapping 

Once the sequence (or a portion of the sequence) of a gene has been isolated, this 
sequence can be used to map the location of the gene on a chromosome. Accordingly, 
IQ nucleic acid molecules described herein or fragments thereof, can be used to map the 
location of the corresponding genes on a chromosome. The mapping of the sequences to 
chromosomes is an important first step in correlating these sequences with genes 
associated with disease. 

Briefly, genes can be mapped to chromosomes by preparing PCR primers 
1 5 (preferably 1 5-25 bp in length) from the sequence of a gene of the invention. Computer 
analysis of the sequence of a gene of the invention can be used to rapidly select primers 
that do not span more than one exon in the genomic DNA, thus complicating the 
amplification process. These primers can then be used for PCR screening of somatic cell 
hybrids containing individual human chromosomes. Only those hybrids containing the 
20 human gene corresponding to the gene sequences will yield an amplified fragment. For a 
review of this technique, see D'Eustachio et al. ((1983) Science 220:919-924). 

PCR mapping of somatic cell hybrids is a rapid procedure for assigning a 
particular sequence to a particular chromosome. Three or more sequences can be assigned 
per day using a single thermal cycler. Using the nucleic acid sequences of the invention to 
design oligonucleotide primers, sublocalization can be achieved with panels of fragments 
from specific chromosomes. Other mapping strategies which can similarly be used to map 
a gene to its chromosome include in situ hybridization (described in Fan et al. (1990) 
Proc. Natl Acad. ScL USA 87:6223-27), pre-screening with labeled flow-sorted 
chromosomes (CITE), and pre-selection by hybridization to chromosome specific cDNA 
30 libraries. Fluorescence in situ hybridization (FISH) of a DNA sequence to a metaphase 
chromosomal spread can further be used to provide a precise chromosomal location in one 
step. For a review of this technique, see Verma et al., (Human Chromosomes: A Manual 
of Basic Techniques (Pergamon Press, New York, 1988)). 

2 5 Reagents for chromosome mapping can be used individually to mark a single 

chromosome or a single site on that chromosome, or panels of reagents can be used for 
marking multiple sites and/or multiple chromosomes. Reagents corresponding to 
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noncoding regions of the genes actually are preferred for mapping purposes. Coding 
sequences are more likely to be conserved within gene families, thus increasing the chance 
of cross hybridizations during chromosomal mapping. 

Once a sequence has been mapped to a precise chromosomal location, the physical 
J position of the sequence on the chromosome can be correlated with genetic map data. 
(Such data are found, for example, in V. McKusick, Mendelian Inheritance in Man, 
available on-line through Johns Hopkins University Welch Medical Library). The 
relationship between genes and disease, mapped to the same chromosomal region, can 
then be identified through linkage analysis (co-inheritance of physically adjacent genes), 
10 described in, e.g., Egeland et al. (1987) Nature 325:783-787. 

Moreover, differences in the DNA sequences between individuals affected and 
unaffected with a disease associated with a gene of the invention can be determined. If a 
mutation is observed in some or all of the affected individuals but not in any unaffected 
individuals, then the mutation is likely to be the causative agent of the particular disease. 
Comparison of affected and unaffected individuals generally involves first looking for 
structural alterations in the chromosomes such as deletions or translocations that are 
visible from chromosome spreads or detectable using PGR based on that DNA sequence. 
Ultimately, complete sequencing of genes from several individuals can be performed to 
confirm the presence of a mutation and to distinguish mutations from polymorphisms. 

Furthermore, the nucleic acid sequences disclosed herein can be used to perform 
searches against "mapping databases", e.g., BLAST-type search, such that the 
chromosome position of the gene is identified by sequence homology or identity with 
known sequence fragments which have been mapped to chromosomes. 

25 In the instant case, the human gene for INTERCEPT 258 has been mapped to the 

long arm of chromosome 1 1, in the region q23. Flanking markers for this region are 
Dl 1S936 and Dl 1S933. The CMT4B (Charcot Marie Tooth neuropathy), ED4 
(ecotodermal dysplasia), JBS (Jacobsen Syndrome), TCPT (thrombocytopenia) loci also 
map to this region of the human chromosome. The APOLP1 (apoplipoprotein cluster), 

30 DRD2 (dopamine receptor), and RDX (radixin) genes also map to this region of the 
human chromosome. This region is syntenic to mouse chromosome 9. The atm (ataxia 
telangiectasia), ruf (rough fur), and vs (variable spotting) loci map to this region of the 
mouse chromosome. The lu (luxoid), vs (variable spotting), atm (ataxia telangiectasia), 
rug (rough fur), and lapl (leucine arylaminopeptidase) genes also map to this region of the 

35 mouse chromosome. 
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A polypeptide and fragments and sequences thereof and antibodies specific thereto 
can be used to map the location of the gene encoding the polypeptide on a chromosome. 
This mapping can be carried out by specifically detecting the presence of the polypeptide 
in members of a panel of somatic cell hybrids between cells of a first species of animal 

5 from which the protein originates and cells from a second species of animal and then 
determining which somatic cell hybrid(s) expresses the polypeptide and noting the 
chromosome(s) from the first species of animal that it contains. For examples of this 
technique, see Pajunen et al (1988> Cytogenet. Cell Genet. 47:31 -41 and Van Keuren et 
al. (1986J Hum. Genet 74:34-40. Alternatively, the presence of the polypeptide in the 

10 somatic cell hybrids can be determined by assaying an activity or property of the 

polypeptide, for example, enzymatic activity, as described in Bordelon-Riser et al. (1979,) 
Somatic Cell Genetics 5:597-613 and Owerbach et al (1978; Proc. Natl. Acad. Sci. USA 
75:5640-5644. 



2. Tissue Typing 

The nucleic acid sequences of the present invention can also be used to identify 
individuals from minute biological samples. The United States military, for example, is 
considering the use of restriction fragment length polymorphism (RFLP) for identification 

20 of its personnel. In this technique, an individual's genomic DNA is digested with one or 
more restriction enzymes, and probed on a Southern blot to yield unique bands for 
identification. This method does not suffer from the current limitations of "Dog Tags" 
which can be lost, switched, or stolen, making positive identification difficult. The 
sequences of the present invention are useful as additional DNA markers for RFLP 

25 (described in U.S. Patent 5,272,057). 

Furthermore, the sequences of the present invention can be used to provide an 
alternative technique which determines the actual base-by-base DNA sequence of selected 
portions of an individual's genome. Thus, the nucleic acid sequences described herein can 
be used to prepare two PCR primers from the 5' and 3' ends of the sequences. These 
30 primers can then be used to amplify an individual's DNA and subsequently sequence it. 

Panels of corresponding DNA sequences from individuals, prepared in this 
manner, can provide unique individual identifications, as each individual will have a 
unique set of such DNA sequences due to allelic differences. The sequences of the present 
invention can be used to obtain such identification sequences from individuals and from 

35 

tissue. The nucleic acid sequences of the invention uniquely represent portions of the 
human genome. Allelic variation occurs to some degree in the coding regions of these 
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sequences, and to a greater degree in the noncoding regions. It is estimated that allelic 
variation between individual humans occurs with a frequency at about once per each 500 
bases. Each of the sequences described herein can, to some degree, be used as a standard 
against which DNA from an individual can be compared for identification purposes. 

5 Because greater numbers of polymorphisms occur in the noncoding regions, fewer 

sequences are necessary to differentiate individuals. The noncoding sequences of SEQ ID 
NO:l, 8, 15, 21, 26, 37, 46 or 56, can comfortably provide positive individual 
identification with a panel of perhaps 10 to 1,000 primers which each yield a noncoding 
amplified sequence of 100 bases. If predicted coding sequences, such as those in SEQ ID 

1 0 NO:2, 9, 16, 22, 27, 38, 47 or 57 are used, a more appropriate number of primers for 
positive individual identification would be 500-2,000. 

If a panel of reagents from the nucleic acid sequences described herein is used to 
generate a unique identification database for an individual, those same reagents can later 
be used to identify tissue from that individual. Using the unique identification database, 
1 5 positive identification of the individual, living or dead, can be made from extremely small 
tissue samples. 



3. Use of Partial Gene Sequences in Forensic Biology 

20 DNA-based identification techniques can also be used in forensic biology. 

Forensic biology is a scientific field employing genetic typing of biological evidence 
found at a crime scene as a means for positively identifying, for example, a perpetrator of 
a crime. To make such an identification, PCR technology can be used to amplify DNA 
sequences taken from very small biological samples such as tissues, e.g., hair or skin, or 

25 body fluids, e.g., blood, saliva, or semen found at a crime scene. The amplified sequence 
can then be compared to a standard, thereby allowing identification of the origin of the 
biological sample. 

The sequences of the present invention can be used to provide polynucleotide 
reagents, e.g., PCR primers, targeted to specific loci in the human genome, which can 

30 

enhance the reliability of DNA-based forensic identifications by, for example, providing 
another "identification marker M (i.e. another DNA sequence that is unique to a particular 
individual). As mentioned above, actual base sequence information can be used for 
identification as an accurate alternative to patterns formed by restriction enzyme generated 
fragments. Sequences targeted to noncoding regions are particularly appropriate for this 

35 

use as greater numbers of polymorphisms occur in the noncoding regions, making it easier 
to differentiate individuals using this technique. Examples of polynucleotide reagents 
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include the nucleic acid sequences of the invention or portions thereof, e.g., fragments 
derived from noncoding regions having a length of at least 20 or 30 bases. 

The nucleic acid sequences described herein can further be used to provide 
polynucleotide reagents, e.g., labeled or labelable probes which can be used in, for 
5 example, an in situ hybridization technique, to identify a specific tissue, e.g., brain tissue. 
This can be very useful in cases where a forensic pathologist is presented with a tissue of 
unknown origin. Panels of such probes can be used to identify tissue by species and/or by 
organ type. 

10 

C. Predictive Medicine 

The present invention also pertains to the field of predictive medicine in which 
diagnostic assays, prognostic assays, and monitoring clinical trials are used for prognostic 
(predictive) purposes to thereby treat an individual prophylactically. Accordingly, one 

1 5 aspect of the present invention relates to diagnostic assays for determining TANGO 253, 
TANGO 257, INTERCEPT 258, or TANGO 281 protein and/or nucleic acid expression as 
well as TANGO 253, TANGO 257, INTERCEPT 258, or TANGO 281 activity, in the 
context of a biological sample (e.g., blood, serum, cells, tissue) to thereby determine 
whether an individual is afflicted with a disease or disorder, or is at risk of developing a 

20 disorder, associated with aberrant or unwanted TANGO 253, TANGO 257, INTERCEPT 
258, or TANGO 281 expression or activity. The invention also provides for prognostic (or 
predictive) assays for determining whether an individual is at risk of developing a disorder 
associated with TANGO 253, TANGO 257, INTERCEPT 258, or TANGO 281 protein, 
nucleic acid expression or activity. For example, mutations in a TANGO 253, TANGO 

25 257, INTERCEPT 258, or TANGO 281 gene can be assayed in a biological sample. Such 
assays can be used for prognostic or predictive purpose to thereby prophylactically treat an 
individual prior to the onset of a disorder characterized by or associated with TANGO 
253, TANGO 257, INTERCEPT 258, or TANGO 281 protein, nucleic acid expression or 
activity. 

30 

As an alternative to making determinations based on the absolute expression level 
of selected genes, determinations may be based on the normalized expression levels of 
these genes. Expression levels are normalized by correcting the absolute expression level 
of a TANGO 253, TANGO 257, INTERCEPT 258, or TANGO 281 gene by comparing its 
expression to the expression of a gene that is not a TANGO 253, TANGO 257, 
INTERCEPT 258, or TANGO 281 gene, e.g., a housekeeping gene that is constitutively 
expressed. Suitable genes for normalization include housekeeping genes such as the actin 
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gene. This normalization allows the comparison of the expression level in one sample, 
e.g., a patient sample, to another sample, e.g., a sample from an individual without a 
particular disease or disorder, or a sample from a healthy individual, or between samples 
from different sources. 

5 Alternatively, the expression level can be provided as a relative expression level. 

To determine a relative expression level of a gene, the level of expression of the gene is 
determined for 10 or more samples of different cell isolates (e.g., neural cell isolates, glial 
cell isolates, immune cell isolates, platelet isolates, megakaryocyte isolates, endothelial 
cell isolates, and osteocyte isolates) preferably 50 or more samples, prior to the 

10 determination of the expression level for the sample in question. The mean expression 
level of each of the genes assayed in the larger number of samples is determined and this 
is used as a baseline expression level for the gene(s) in question. The expression level of 
the gene determined for the test sample (absolute level of expression) is then divided by 
the mean expression value obtained for that gene. This provides a relative expression 

1 5 level and aids in identifying extreme cases of diseases and disorders such as obesity, 
coronary disorders (e.g., atherosclerosis), neuronal disorders, pulmonary disorders, renal 
disorders, and bleeding disorders. 

Preferably, the samples used in the baseline determination will be from diseased or 
from non-diseased cells of the appropriate cell type or tissue. The choice of the cell source 
is dependent on the use of the relative expression level. Using expression found in normal 
tissues as a mean expression score aids in validating whether the TANGO 253, TANGO 
257, INTERCEPT 258, or TANGO 281 gene assayed is specific (versus normal cells). 
Such a use is particularly important in identifying whether a TANGO 253, TANGO 257, 
INTERCEPT 258, or TANGO 28 1 gene can serve as a target gene. In addition, as more 
data is accumulated, the mean expression value can be revised, providing improved 
relative expression values based on accumulated data. Expression data from cells provides 
a means for grading the severity of the disease or disorder state. 

Another aspect of the invention pertains to monitoring the influence of agents (e.g., 
30 drugs, compounds) on the expression or activity of TANGO 253, TANGO 257, 

INTERCEPT 258, or TANGO 281 in clinical trials. These and other agents are described 
in further detail in the following sections. 

1. Diagnostic Assays 

35 ' ' ' ~ ; ~™ 

An exemplary method for detecting the presence or absence of a polypeptide or 
nucleic acid of the invention in a biological sample involves obtaining a biological sample 



- 120< 



WO 00/78808 



PCTYUS00/16883 



from a test subject and contacting the biological sample with a compound or an agent 
capable of detecting a polypeptide or nucleic acid (e.g., mRNA, genomic DNA) of the 
invention such that the presence of a polypeptide or nucleic acid of the invention is 
detected in the biological sample. A preferred agent for detecting mRNA or genomic 

5 DNA encoding a polypeptide of the invention is a labeled nucleic acid probe capable of 
hybridizing to mRNA or genomic DNA encoding a polypeptide of the invention. The 
nucleic acid probe can be, for example, a full-length cDNA, such as the nucleic acid of 
SEQ ID NO:l, 2, 8, 9, 15, 16, 21, 22, 26, 27, 37, 38, 46, 47, 56 or 57, or a portion thereof, 
such as an oligonucleotide of at least 15, 30, 50, 100, 250 or 500 nucleotides in length and 

10 sufficient to specifically hybridize under stringent conditions to a mRNA or genomic 
DNA encoding a polypeptide of the invention. Other suitable probes for use in the 
diagnostic assays of the invention are described herein. 

A preferred agent for detecting a polypeptide of the invention is an antibody 
capable of binding to a polypeptide of the invention, preferably an antibody with a 

15 detectable label. Antibodies can be polyclonal, or more preferably, monoclonal. An intact 
antibody, or a fragment thereof (e.g., Fab or F(ab')2) can be used. The term "labeled", 
with regard to the probe or antibody, is intended to encompass direct labeling of the probe 
or antibody by coupling (i.e., physically linking) a detectable substance to the probe or 
antibody, as well as indirect labeling of the probe or antibody by reactivity with another 

20 reagent that is directly labeled. Examples of indirect labeling include detection of a 

primary antibody using a fluorescently labeled secondary antibody and end-labeling of a 
DNA probe with biotin such that it can be detected with fluorescently labeled streptavidin. 
The term "biological sample" is intended to include tissues, cells and biological fluids 
isolated from a subject, as well as tissues, cells and fluids present within a subject. That 

25 is, the detection method of the invention can be used to detect mRNA, protein, or genomic 
DNA in a biological sample in vitro as well as in vivo. For example, in vitro techniques 
for detection of mRNA include Northern hybridizations and in situ hybridizations. In 
vitro techniques for detection of a polypeptide of the invention include enzyme linked 
immunosorbent assays (ELISAs), Western blots, immunoprecipitations and 

3 0 immunofluorescence. In vitro techniques for detection of genomic DNA include Southern 
hybridizations. Furthermore, in vivo techniques for detection of a polypeptide of the 
invention include introducing into a subject a labeled antibody directed against the 
polypeptide. For example, the antibody can be labeled with a radioactive marker whose 
presence and location in a subject can be detected by standard imaging techniques. 

35 

In one embodiment, the biological sample contains protein molecules from the test 
subject. Alternatively, the biological sample can contain mRNA molecules from the test 
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subject or genomic DNA molecules from the test subject. A preferred biological sample is 
a peripheral blood leukocyte sample isolated by conventional means from a subject. 

In another embodiment, the methods further involve obtaining a control biological 
sample from a control subject, contacting the control sample with a compound or agent 

5 capable of detecting a polypeptide of the invention or mRNA or genomic DNA encoding a 
polypeptide of the invention, such that the presence of the polypeptide or mRNA or 
genomic DNA encoding the polypeptide is detected in the biological sample, and 
comparing the presence of the polypeptide or mRNA or genomic DNA encoding the 
polypeptide in the control sample with the presence of the polypeptide or mRNA or 

10 genomic DNA encoding the polypeptide in the test sample. 

The invention also encompasses kits for detecting the presence of a polypeptide or 
nucleic acid of the invention in a biological sample (a test sample). Such kits can be used 
to determine if a subject is suffering from or is at increased risk of developing a disorder 
associated with aberrant expression of a polypeptide of the invention, as discussed, for 
example, in sections above relating to uses of the sequences of the invention. 

For example, kits can be used to determine if a subject is suffering from or is at 
increased risk of disorders such as coronary disorders (e.g., heart diseases and disorders 
such as atherosclerosis., coronary artery disease and plaque formation), and adipocyte- 

20 related disorders (e.g., obesity), which are associated with aberrant TANGO 253 

expression. In another example, kits can be used to determine if a subject is suffering 
from or is at increased risk of disorders such as coronary disorders (e.g., heart diseases and 
disorders such as atherosclerosis, coronary artery disease and plague formation), olfactory 
disorders, neurological disorders (e.g., neurodegenerative disorders, neuromuscular 

25 disorders, cognitive disorders, personality disorders, and motor disorder) and pulmonary 
disorders, (e.g., cystic fibrosis), which are associated with aberrant TANGO 257 
expression. In another example, kits can be used to determine if a subject is suffering from 
or is at increased risk of disorders such as Type I immunologic disorders, (e.g., 
anaphylaxis and rhinitis), which are associated with aberrant INTERCEPT 258 expression. 

30 In another example, kits can be used to determine if a subject is suffering from or is at 
increased risk of disorders such as immunological disorders, (e.g. thrombocytopenia and 
platelet disorders), developmental disorders, coronary disorders, e.g., ischemic heart 
disease or atherosclerosis, neurological disorders, (e.g., head trauma and brain cancer), 
pulmonary disorders, (e.g., lung cancer, cystic fibrosis and rheumatoid lung disease), 

35 kidney disorders, (e.g., glomerulonephritis and end stage renal disease), autoimmune 
disorders, (e.g., Crohn's disease) and embryonic disorders, which are associated with 
aberrant TANGO 281 expression. The kit, for example, can comprise a labeled compound 
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or agent capable of detecting the polypeptide or mRNA encoding the polypeptide in a 
biological sample and means for determining the amount of the polypeptide or mRNA in 
the sample {e.g., an antibody which binds the polypeptide or an oligonucleotide probe 
which binds to DNA or mRNA encoding the polypeptide). Kits can also include 
5 instructions for observing that the tested subject is suffering from or is at risk of 

developing a disorder associated with aberrant expression of the polypeptide if the amount 
of the polypeptide or mRNA encoding the polypeptide is above or below a normal level. 

For antibody-based kits, the kit can comprise, for example: (1) a first antibody 
(eg., attached to a solid support) which binds to a polypeptide of the invention; and, 
10 optionally, (2) a second, different antibody which binds to either the polypeptide or the 
first antibody and is conjugated to a detectable agent. 

For oligonucleotide-based kits, the kit can comprise, for example: (1) an 
oligonucleotide, e.g., a detectably labeled oligonucleotide, which hybridizes to a nucleic 

^ acid sequence encoding a polypeptide of the invention or (2) a pair of primers useful for 
amplifying a nucleic acid molecule encoding a polypeptide of the invention. The kit can 
also comprise, e.g., a buffering agent, a preservative, or a protein stabilizing agent. The 
kit can also comprise components necessary for detecting the detectable agent (e.g., an 
enzyme or a substrate). The kit can also contain a control sample or a series of control 

^ samples which can be assayed and compared to the test sample contained. Each 

component of the kit is usually enclosed within an individual container and all of the 
various containers are within a single package along with instructions for observing 
whether the tested subject is suffering from or is at risk of developing a disorder 
associated with aberrant expression of the polypeptide. 

25 

2. Prognostic Assays 

The methods described herein can furthermore be utilized as diagnostic or 
prognostic assays to identify subjects having or at risk of developing a disease or disorder 
associated with aberrant expression or activity of a polypeptide of the invention. For 

30 

example, the assays described herein, such as the preceding diagnostic assays or the 
following assays, can be utilized to identify a subject having or at risk of developing a 
disorder associated with aberrant expression or activity of a polypeptide of the invention, 
e.g., coronary disorders, pulmonary disorders, kidney disorders or embryonic disorders. 
Alternatively, the prognostic assays can be utilized to identify a subject having or at risk 

35 

for developing such a disease or disorder. Thus, the present invention provides a method 
in which a test sample is obtained from a subject and a polypeptide or nucleic acid (e.g., 
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mRNA, genomic DNA) of the invention is detected, wherein the presence of the 
polypeptide or nucleic acid is diagnostic for a subject having or at risk of developing a 
disease or disorder associated with aberrant expression or activity of the polypeptide. As 
used herein, a "test sample" refers to a biological sample obtained from a subject of 
interest. For example, a test sample can be a biological fluid (e.g., serum), cell sample, or 
tissue. 

The prognostic assays described herein, for example, can be used to identify a 
subject having or at risk of developing disorders such as disorders discussed, for example, 
in Sections above relating to uses of the sequences of the invention. 

For example, such disorders can include coronary disorders (e.g., heart diseases 
and disorders such as atherosclerosis, coronary artery disease and plague formation) and 
adipocyte disorders {e.g., obesity), which are associated with aberrant TANGO 253 
expression. In another example, prognostic assays described herein, can be used to 
identify a subject having or at risk of developing disorders such as coronary disorders 
(e.g., heart diseases and disorders such as atherosclerosis, coronary artery disease and 
plague formation), olfactory disorders, neurological disorders (e.g., neurodegenerate 
disorders, neuromuscular disorders, cognitive disorders, personality disorders, and motor 
disorders), and pulmonary disorders, (e.g., cystic fibrosis), which are associated with 
aberrant TANGO 257 expression. In another example, prognostic assays described herein, 
can be used to identify a subject having or at risk of developing disorders such as Type I 
immunologic disorders, (e.g., anaphylaxis and rhinitis), which are associated with aberrant 
INTERCEPT 258 expression. In another example, prognostic assays described herein, for 
example, can be used to identify a subject having or at risk of developing disorders such as 
immunological disorders, (e.g. thrombocytopenia and platelet disorders), developmental 
disorders, coronary disorders, (e.g., ischemic heart disease and atherosclerosis), 
neurological disorders, (e.g., head trauma and brain cancer), pulmonary disorders, (e.g., 
lung cancer, cystic fibrosis and rheumatoid lung disease), kidney disorders, (e.g., 
glomerulonephritis and end stage renal disease), autoimmune disorders, (e.g., Crohn's 
disease) and embryonic disorders, which are associated with aberrant TANGO 281 
expression. 



Furthermore, the prognostic assays described herein can be used to determine 
whether a subject can be administered an agent (e.g., an agonist, antagonist, 
peptidomimetic, protein, peptide, nucleic acid, small molecule, or other drug candidate) to 
35 treat a disease or disorder associated with aberrant expression or activity of a polypeptide 
of the invention. For example, such methods can be used to determine whether a subject 
can be effectively treated with a specific agent or class of agents (e.g., agents of a type 
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which decrease activity of the polypeptide). Thus, the present invention provides methods 
for determining whether a subject can be effectively treated with an agent for a disorder 
associated with aberrant expression or activity of a polypeptide of the invention in which a 
test sample is obtained and the polypeptide or nucleic acid encoding the polypeptide is 
5 detected (e.g., wherein the presence of the polypeptide or nucleic acid is diagnostic for a 
subject that can be administered the agent to treat a disorder associated with aberrant 
expression or activity of the polypeptide). 

The methods of the invention can also be used to detect genetic lesions or 
mutations in a gene of the invention, thereby determining if a subject with the lesioned 

10 gene is at risk for a disorder characterized aberrant expression or activity of a polypeptide 
of the invention. In preferred embodiments, the methods include detecting, in a sample of 
cells from the subject, the presence or absence of a genetic lesion or mutation 
characterized by at least one of an alteration affecting the integrity of a gene encoding the 
polypeptide of the invention, or the mis-expression of the gene encoding the polypeptide 

15 of the invention. For example, such genetic lesions or mutations can be detected by 

ascertaining the existence of at least one of: 1) a deletion of one or more nucleotides from 
the gene; 2) an addition of one or more nucleotides to the gene; 3) a substitution of one or 
more nucleotides of the gene; 4) a chromosomal rearrangement of the gene; 5) an 
alteration in the level of a messenger RNA transcript of the gene; 6) an aberrant 

20 modification of the gene, such as of the methylation pattern of the genomic DNA; 7) the 
presence of a non-wild type splicing pattern of a messenger RNA transcript of the gene; 8) 
a non-wild type level of a the protein encoded by the gene; 9) an allelic loss of the gene; 
and 10) an inappropriate post-translational modification of the protein encoded by the 
gene. As described herein, there are a large number of assay techniques known in the art 

25 which can be used for detecting lesions in a gene. 

In certain embodiments, detection of the lesion involves the use of a probe/primer 
in a polymerase chain reaction (PGR) (see, e.g., U.S. Patent Nos. 4,683,195 and 
4,683,202), such as anchor PCR or RACE PCR, or, alternatively, in a ligation chain 
reaction (LCR) (see, e.g., Landegran et al. (1988) Science 241 : 1077-1080; and Nakazawa 

30 

et al. (1994) Proc. Natl. Acad. ScL USA 91:360-364), the latter of which can be 
particularly useful for detecting point mutations in a gene (see, e.g., Abravaya et al. (1995) 
Nucleic Acids Res. 23:675-682). This method can include the steps of collecting a sample 
of cells from a patient, isolating nucleic acid (e.g., genomic, mRNA or both) from the cells 
of the sample, contacting the nucleic acid sample with one or more primers which 
specifically hybridize to the selected gene under conditions such that hybridization and 
amplification of the gene (if present) occurs, and detecting the presence or absence of an 
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amplification product, or detecting the size of the amplification product and comparing the 
length to a control sample. It is anticipated that PCR and/or LCR may be desirable to use 
as a preliminary amplification step in conjunction with any of the techniques used for 
detecting mutations described herein. 

Alternative amplification methods include: self sustained sequence replication 
(Guatelli et al. (1990) Proc. Natl. Acad. Sci. USA 87:1874-1878), transcriptional 
amplification system (Kwoh, et al. (1989) Proc. Natl. Acad. ScL USA 86:1 173-1 177), 
Q-Beta Replicase (Lizardi et al. (1988) Bio/Technology 6: 1 197), or any other nucleic acid 
amplification method, followed by the detection of the amplified molecules using 
techniques well known to those of skill in the art. These detection schemes are especially 
useful for the detection of nucleic acid molecules if such molecules are present in very low 
numbers. 

In an alternative embodiment, mutations in a selected gene from a sample cell can 
be identified by alterations in restriction enzyme cleavage patterns. For example, sample 
and control DNA is isolated, amplified (optionally), digested with one or more restriction 
endonucleases, and fragment length sizes are determined by gel electrophoresis and 
compared. Differences in fragment length sizes between sample and control DNA 
indicates mutations in the sample DNA. Moreover, the use of sequence specific 
ribozymes (see, e.g., U.S. Patent No. 5,498,531) can be used to score for the presence of 
specific mutations by development or loss of a ribozyme cleavage site. 

In other embodiments, genetic mutations can be identified by hybridizing a sample 
and control nucleic acids, e.g., DNA or RNA, to high density arrays containing hundreds 
or thousands of oligonucleotides probes (Cronin et al. (1996) Human Mutation 7:244-255; 

25 Kozal et al. (1996) Nature Medicine 2:753-759). For example, genetic mutations can be 
identified in two-dimensional arrays containing light-generated DNA probes as described 
in Cronin et al., supra. Briefly, a first hybridization array of probes can be used to scan 
through long stretches of DNA in a sample and control to identify base changes between 
the sequences by making linear arrays of sequential overlapping probes. This step allows 

30 the identification of point mutations. This step is followed by a second hybridization array 
that allows the characterization of specific mutations by using smaller, specialized probe 
arrays complementary to all variants or mutations detected. Each mutation array is 
composed of parallel probe sets, one complementary to the wild-type gene and the other 
complementary to the mutant gene. 

35 In yet another embodiment, any of a variety of sequencing reactions known in the 

art can be used to directly sequence the selected gene and detect mutations by comparing 



20 
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the sequence of the sample nucleic acids with the corresponding wild-type (control) 
sequence. Examples of sequencing reactions include those based on techniques developed 
by Maxim and Gilbert ((1977) Proc. Natl Acad. ScL USA 74:560) or Sanger ((1977) 
Proc. Natl Acad, ScL USA 74:5463). It is also contemplated that any of a variety of 
5 automated sequencing procedures can be utilized when performing the diagnostic assays 
((1995) Bio/Techniques 19:448), including sequencing by mass spectrometry ( see, e.g., 
PCT Publication No. WO 94/16101; Cohen et al. (1996) Adv. Chromatogr. 36:127-162; 
and Griffin et al. (1993) Appl Biochem. Biotechnol 38:147-159). 

Other methods for detecting mutations in a selected gene include methods in which 
10 protection from cleavage agents is used to detect mismatched bases in RNA/RNA or 
RNA/DNA heteroduplexes (Myers et al. (1985) Science 230:1242). In general, the 
technique of mismatch cleavage entails providing heteroduplexes formed by hybridizing 
(labeled) RNA or DNA containing the wild-type sequence with potentially mutant RNA or 
DNA obtained from a tissue sample. The double-stranded duplexes are treated with an 
1 5 agent which cleaves single-stranded regions of the duplex such as which will exist due to 
basepair mismatches between the control and sample strands. RNA/DNA duplexes can be 
treated with RNase to digest mismatched regions, and DNA/DNA hybrids can be treated 
with SI nuclease to digest mismatched regions. 

In other embodiments, either DNA/DNA or RNA/DNA duplexes can be treated 
20 F 

with hydroxylamine or osmium tetroxide and with piperidine in order to digest 

mismatched regions. After digestion of the mismatched regions, the resulting material is 

then separated by size on denaturing polyacrylamide gels to determine the site of 

mutation. See t e.g., Cotton et al. (1988) Proc. Natl Acad. Sci. USA 85:4397; Saleeba et al. 

(1992) Methods Enzymol 217:286-295. In a preferred embodiment, the control DNA or 

RNA can be labeled for detection. 

In still another embodiment, the mismatch cleavage reaction employs one or more 
proteins that recognize mismatched base pairs in double-stranded DNA (so called "DNA 
mismatch repair" enzymes) in defined systems for detecting and mapping point mutations 

30 in cDNAs obtained from samples of cells. For example, the mutY enzyme of E. coli 
cleaves A at G/A mismatches and the thymidine DNA glycosylase from HeLa cells 
cleaves T at G/T mismatches (Hsu et al. (1994) Carcinogenesis 15:1657-1662). 
According to an exemplary embodiment, a probe based on a selected sequence, e.g., a 
wild-type sequence, is hybridized to a cDNA or other DNA product from a test cell(s). 

35 The duplex is treated with a DNA mismatch repair enzyme, and the cleavage products, if 
any, can be detected from electrophoresis protocols or the like. See, e.g., U.S. Patent No. 
5,459,039. 
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In other embodiments, alterations in electrophoretic mobility will be used to 
identify mutations in genes. For example, single strand conformation polymorphism 
(SSCP) may be used to detect differences in electrophoretic mobility between mutant and 
wild type nucleic acids (Orita et al. (1989) Proc. Natl. Acad. Sci. USA 86:2766; see also 

5 Cotton (1993) MutaL Res. 285:125-144; Hayashi (1992) Genet. Anal. Tech. Appl. 

9:73-79). Single-stranded DNA fragments of sample and control nucleic acids will be 
denatured and allowed to renature. The secondary structure of single-stranded nucleic 
acids varies according to sequence, and the resulting alteration in electrophoretic mobility 
enables the detection of even a single base change. The DNA fragments may be labeled or 

10 detected with labeled probes. The sensitivity of the assay may be enhanced by using RNA 
(rather than DNA), in which the secondary structure is more sensitive to a change in 
sequence. In a preferred embodiment, the subject method utilizes heteroduplex analysis to 
separate double stranded heteroduplex molecules on the basis of changes in 
electrophoretic mobility (Keen et al. (1991) Trends Genet. 7:5). 

1 5 In yet another embodiment, the movement of mutant or wild-type fragments in 

polyacrylamide gels containing a gradient of denaturant is assayed using denaturing 
gradient gel electrophoresis (DGGE) (Myers et al. (1985) Nature 313:495). When DGGE 
is used as the method of analysis, DNA will be modified to insure that it does not 
completely denature, for example by adding a *GC clamp of approximately 40 bp of 

20 high-melting GC-rich DNA by PCR. In a further embodiment, a temperature gradient is 
used in place of a denaturing gradient to identify differences in the mobility of control and 
sample DNA (Rosenbaum and Reissner (1987) Biophys. Chem. 265:12753). 

Examples of other techniques for detecting point mutations include, but are not 
limited to, selective oligonucleotide hybridization, selective amplification, or selective 
primer extension. For example, oligonucleotide primers may be prepared in which the 
known mutation is placed centrally and then hybridized to target DNA under conditions 
which permit hybridization only if a perfect match is found (Saiki et al. (1986) Nature 
324:163); Saiki et al. (1989) Proc. Natl. Acad. Sci. USA 86:6230). Such allele specific 
oligonucleotides are hybridized to PCR amplified target DNA or a number of different 
mutations when the oligonucleotides are attached to the hybridizing membrane and 
hybridized with labeled target DNA. 

Alternatively, allele specific amplification technology which depends on selective 
PCR amplification may be used in conjunction with the instant invention. 
35 Oligonucleotides used as primers for specific amplification may carry the mutation of 
interest in the center of the molecule (so that amplification depends on differential 
hybridization) (Gibbs et al. (1989) Nucleic Acids Res. 17:2437-2448) or at the extreme 3' 
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end of one primer where, under appropriate conditions, mismatch can prevent or reduce 
polymerase extension (Prossner (1 993) Tibtech 1 1 :238). In addition, it may be desirable 
to introduce a novel restriction site in the region of the mutation to create cleavage-based 
detection (Gasparini et al. (1992) MoL Cell Probes 6:1). It is anticipated that in certain 
5 embodiments amplification may also be performed using Taq ligase for amplification 
(Barany (1991) Proc. Natl. Acad ScL USA 88:189). In such cases, ligation will occur 
only if there is a perfect match at the 3* end of the 5' sequence making it possible to detect 
the presence of a known mutation at a specific site by looking for the presence or absence 
of amplification. 

10 The methods described herein may be performed, for example, by utilizing 

pre-packaged diagnostic kits comprising at least one probe nucleic acid or antibody 
reagent described herein, which may be conveniently used, e.g., in clinical settings to 
diagnose patients exhibiting symptoms or family history of a disease or illness involving a 
gene encoding a polypeptide of the invention. Furthermore, any cell type or tissue, e.g., 

1 5 chondrocytes, in which the polypeptide of the invention is expressed may be utilized in the 
prognostic assays described herein. 

3. Pharmacogenomics 

20 Agents, or modulators which have a stimulatory or inhibitory effect on activity or 

expression of a polypeptide of the invention as identified by a screening assay described 
herein can be administered to individuals to treat (prophylactically or therapeutically) 
disorders associated with aberrant activity of the polypeptide. In conjunction with such 
treatment, the pharmacogenomics (i.e., the study of the relationship between an 

25 individual's genotype and that individual's response to a foreign compound or drug) of the 
individual may be considered. Differences in metabolism of therapeutics can lead to 
severe toxicity or therapeutic failure by altering the relation between dose and blood 
concentration of the pharmacologically active drug. Thus, the pharmacogenomics of the 
individual permits the selection of effective agents {e.g., drugs) for prophylactic or 

30 therapeutic treatments based on a consideration of the individual's genotype. Such 

pharmacogenomics can further be used to determine appropriate dosages and therapeutic 
regimens. Accordingly, the activity of a polypeptide of the invention, expression of a 
nucleic acid of the invention, or mutation content of a gene of the invention in an 
individual can be determined to thereby select appropriate agent(s) for therapeutic or 

35 prophylactic treatment of the individual. 
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Pharmacogenomics deals with clinically significant hereditary variations in the 
response to drugs due to altered drug disposition and abnormal action in affected persons. 
See, e.g., Linder (1997) Clin. Chem. 43(2):254-266. In general, two types of 
pharmacogenetic conditions can be differentiated. Genetic conditions transmitted as a 

5 single factor altering the way drugs act on the body are referred to as "altered drug action." 
Genetic conditions transmitted as single factors altering the way the body acts on drugs are 
referred to as "altered drug metabolism". These pharmacogenetic conditions can occur 
either as rare defects or as polymorphisms. For example, glucose-6-phosphate 
dehydrogenase deficiency (G6PD) is a common inherited enzymopathy in which the main 

1 0 clinical complication is haemolysis after ingestion of oxidant drugs (anti-malarials, 
sulfonamides, analgesics, nitrofurans) and consumption of fava beans. 

As an illustrative embodiment, the activity of drug metabolizing enzymes is a 
major determinant of both the intensity and duration of drug action. The discovery of 
genetic polymorphisms of drug metabolizing enzymes (e.g., N-acetyltransferase 2 (NAT 

15 2) and cytochrome P450 enzymes CYP2D6 and CYP2C19) has provided an explanation 
as to why some patients do not obtain the expected drug effects or show exaggerated drug 
response and serious toxicity after taking the standard and safe dose of a drug. These 
polymorphisms are expressed in two phenotypes in the population, the extensive 
metabolizer (EM) and poor metabolizer (PM). The prevalence of PM is different among 

20 different populations. For example, the gene coding for CYP2D6 is highly polymorphic 
and several mutations have been identified in PM, which all lead to the absence of 
functional CYP2D6. Poor metabolizers of CYP2D6 and CYP2C19 quite frequently 
experience exaggerated drug response and side effects when they receive standard doses. 
If a metabolite is the active therapeutic moiety, a PM will show no therapeutic response, 

25 as demonstrated for the analgesic effect of codeine mediated by its CYP2D6-formed 

metabolite morphine. The other extreme are the so called ultra-rapid metabolizers who do 
not respond to standard doses. Recently, the molecular basis of ultra-rapid metabolism 
has been identified to be due to CYP2D6 gene amplification. 

^ Thus, the activity of a polypeptide of the invention, expression of a nucleic acid 

encoding the polypeptide, or mutation content of a gene encoding the polypeptide in an 
individual can be determined to thereby select appropriate agent(s) for therapeutic or 
prophylactic treatment of the individual. In addition, pharmacogenetic studies can be used 
to apply genotyping of polymorphic alleles encoding drug-metabolizing enzymes to the 
identification of an individual's drug responsiveness phenotype. This knowledge, when 
applied to dosing or drug selection, can avoid adverse reactions or therapeutic failure and 
thus enhance therapeutic or prophylactic efficiency when treating a subject with a 
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modulator of activity or expression of the polypeptide, such as a modulator identified by 
one of the exemplary screening assays described herein. 

4. Monitoring of Effects During Clinical Trials 

Monitoring the influence of agents (e.g., drugs, compounds) on the expression or 
activity of a polypeptide of the invention (e.g., the ability to modulate aberrant cell 
proliferation chemotaxis, and/or differentiation) can be applied not only in basic drug 
screening, but also in clinical trials. For example, the effectiveness of an agent, as 
determined by a screening assay as described herein, to increase gene expression, protein 
levels or protein activity, can be monitored in clinical trials of subjects exhibiting 
decreased gene expression, protein levels, or protein activity. Alternatively, the 
effectiveness of an agent, as determined by a screening assay, to decrease gene expression, 
protein levels or protein activity, can be monitored in clinical trials of subjects exhibiting 
increased gene expression, protein levels, or protein activity. In such clinical trials, 
expression or activity of a polypeptide of the invention and preferably, that of other 
polypeptide that have been implicated in for example, a cellular proliferation disorder, can 
be used as a marker of the immune responsiveness of a particular cell. 

For example, and not by way of limitation, genes, including those of the invention, 
that are modulated in cells by treatment with an agent (e.g., compound, drug or small 
molecule) which modulates activity or expression of a polypeptide of the invention (e.g., 
as identified in a screening assay described herein) can be identified. Thus, to study the 
effect of agents on cellular proliferation disorders, for example, in a clinical trial, cells can 
be isolated and RNA prepared and analyzed for the levels of expression of a gene of the 
invention and other genes implicated in the disorder. The levels of gene expression (i.e., a 
gene expression pattern) can be quantified by Northern blot analysis or RT-PCR, as 
described herein, or alternatively by measuring the amount of protein produced, by one of 
the methods as described herein, or by measuring the levels of activity of a gene of the 
invention or other genes. In this way, the gene expression pattern can serve as a marker, 
indicative of the physiological response of the cells to the agent. Accordingly, this 
response state may be determined before, and at various points during, treatment of the 
individual with the agent. 

In a preferred embodiment, the present invention provides a method for monitoring 
the effectiveness of treatment of a subject with an agent (e.g., an agonist, antagonist, 
peptidomimetic, protein, peptide, nucleic acid, small molecule, or other drug candidate 
identified by the screening assays described herein) comprising the steps of (i) obtaining a 
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pre-administration sample from a subject prior to administration of the agent; (ii) detecting 
the level of the polypeptide or nucleic acid of the invention in the preadministration 
sample; (iii) obtaining one or more post-administration samples from the subject; (iv) 
detecting the level the of the polypeptide or nucleic acid of the invention in the 

5 post-administration samples; (v) comparing the level of the polypeptide or nucleic acid of 
the invention in the pre-administration sample with the level of the polypeptide or nucleic 
acid of the invention in the post-administration sample or samples; and (vi) altering the 
administration of the agent to the subject accordingly. For example, increased 
administration of the agent may be desirable to increase the expression or activity of the 

10 polypeptide to higher levels than detected, i.e., to increase the effectiveness of the agent. 
Alternatively, decreased administration of the agent may be desirable to decrease 
expression or activity of the polypeptide to lower levels than detected, i.e., to decrease the 
effectiveness of the agent. 



C. Methods of Treatment 

The present invention provides for both prophylactic and therapeutic methods of . 
treating a subject at risk of (or susceptible to) a disorder or having a disorder associated 
with aberrant expression or activity of a polypeptide of the invention, as discussed, for 
20 example, in sections above relating to uses of the sequences of the invention. 

For example, disorders characterized by aberrant expression or activity of the 
polypeptides of the invention include immunologic disorders, coronary disorders, 
pulmonary disorders, neurological disorders, kidney disorders, and autoimmune disorders. 
The nucleic acids, polypeptides, and modulators thereof of the invention can be used to 

25 treat immunologic diseases and disorders, including but not limited to, allergic disorders 
(e.g., anaphylaxis and allergic asthma) autoimmune and inflammatory disorders (e.g., 
atopic dermatitis). Polypeptides of the invention can be used to treat diseases associated 
with bacterial infection (e.g., tuberculosis, e.g., pulmonary tuberculosis), inflammatory 
arthropathy, and bone and cartilage degenerative diseases and disorders (e.g., arthritis, 

30 e.g., rheumatoid arthritis). Polypeptides of the invention can be used to treat pulmonary 
disorders such as lung cancer, cystic fibrosis and rheumatoid lung diseases. Polypeptides 
of the invention can be used to treat coronary disorders, such as ischemic heart disease, 
atherosclerosis and plague formation. Polypeptides of the invention can also be used to 
treat neurological disorders such as neurodegenerate disorders, neuromuscular disorders 

35 and cognitive disorders. Polypeptides of the invention can also be used to treat kidney 
disorders such as glomerulonephritis and end stage renal disease. Further, polypeptides of 
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the invention can be used to treat autoimmune disorders such as Crohns disease, and other 
disorders described herein. 



1. Prophylactic Methods 

In one aspect, the invention provides a method for preventing in a subject, a 
disease or condition associated with an aberrant expression or activity of a polypeptide of 
the invention, by administering to the subject an agent which modulates expression or at 
least one activity of the polypeptide. Subjects at risk for a disease which is caused or 

lO contributed to by aberrant expression or activity of a polypeptide of the invention can be 
identified by, for example, any or a combination of diagnostic or prognostic assays as 
described herein. Administration of a prophylactic agent can occur prior to the 
manifestation of symptoms characteristic of the aberrancy, such that a disease or disorder 
is prevented or, alternatively, delayed in its progression. Depending on the type of 

1 5 aberrancy, for example, an agonist or antagonist agent can be used for treating the subject. 
For example, an antagonist of a TANGO 253, TANGO 257, INTERCEPT 258 or TANGO 
281 proteins may be used to treat an immunologic disorder, e.g., rheumatoid arthritis. The 
appropriate agent can be determined based on screening assays described herein. 



2. Therapeutic Methods 

Another aspect of the invention pertains to methods of modulating expression or 
activity of a polypeptide of the invention for therapeutic purposes. The modulatory 
method of the invention involves contacting a cell with an agent that modulates one or 
more of the activities of the polypeptide. An agent that modulates activity can be an agent 
as described herein, such as a nucleic acid or a protein, a naturally-occurring cognate 
ligand of the polypeptide, a peptide, a peptidomimetic, or other small molecule. In one 
embodiment, the agent stimulates one or more of the biological activities of the 
polypeptide. Examples of such stimulatory agents include the active polypeptide of the 
invention and a nucleic acid molecule encoding the polypeptide of the invention that has 
been introduced into the cell. In another embodiment, the agent inhibits one or more of 
the biological activities of the polypeptide of the invention. Examples of such inhibitory 
agents include antisense nucleic acid molecules and antibodies. These modulatory 
methods can be performed in vitro {e.g., by culturing the cell with the agent) or, 
alternatively, in vivo {e.g., by administering the agent to a subject). As such, the present 
invention provides methods of treating an individual afflicted with a disease or disorder 
characterized by aberrant expression or activity of a polypeptide of the invention. In one 
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embodiment, the method involves administering an agent {e.g., an agent identified by a 
screening assay described herein), or combination of agents that modulates (e.g., 
upregulates or downregulates) expression or activity. In another embodiment, the method 
involves administering a polypeptide of the invention or a nucleic acid molecule of the 
invention as therapy to compensate for reduced or aberrant expression or activity of the 
polypeptide. 

Stimulation of activity is desirable in situations in which activity or expression is 
abnormally low or downregulated and/or in which increased activity is likely to have a 
beneficial effect. Conversely, inhibition of activity is desirable in situations in which 
activity or expression is abnormally high or upregulated and/or in which decreased activity 
is likely to have a beneficial effect. 

This invention is further illustrated by the following examples which should not be 
construed as limiting. The contents of all references, patents and published patent 
applications cited throughout this application are hereby incorporated by reference. 

Deposit of Clones 

Clones containing cDNA molecules encoding human TANGO 253, (clone 
EpT253) human TANGO 257 (EpT257), human INTERCEPT 258 (clone EpT258) and 
human TANGO 281 (clone EpT 281) were deposited with the American Type Culture 
Collection, 10801 University Boulevard, Manassas, VA, 201 10-2209, on April 21, 1999 
as Accession Number 207222, as part of a composite deposit representing a mixture of 
strains, each carrying one recombinant plasmid harboring a particular cDNA clone. 

For this composite deposit, to distinguish the strains and isolate a strain harboring 
a particular cDNA clone, an aliquot of the mixture can be streaked out to single colonies 
on nutrient medium (e.g., LB plates) supplemented with lOOg/ml ampicillin, single 
colonies grown, and then plasmid DNA extracted using a standard minipreparation 
procedure. Next, a sample of the DNA minipreparation can be digested with a 
combination of the restriction enzymes Sail, Notl, Xbal and EcorV and the resultant 
products resolved on a 0.8% agarose gel using standard DNA electrophoresis conditions. 
The digest liberates fragments as follows: 

Human TANGO 253 (clone EpT253): 1.3 kb 
Human TANGO 257 (clone EpT257): 1.8 kb 
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Human INTERCEPT 258 (clone EpT258): 1.0 kb and 0.85 kb (human 
INTERCEPT 258 has a EcorV cut site at about bp 1004). 



Human TANGO 281 (clone EpT281): 0.9 kb and 0.9kb (human TANGO 
281 Has an Xbal cut site at about bp 900). 

The identity of the strains can be inferred from the fragments liberated. 

Clones containing cDNA molecules encoding mouse INTERCEPT 258 were 
deposited with the American Type Culture Collection (Manassas, VA) on April 21, 1999 
as Accession Number 207221, as part of a composite deposit representing a mixture of 
five strains, each carrying one recombinant plasmid harboring a particular cDNA clone. 

To distinguish the strains and isolate a strain harboring a particular cDNA clone, 
an aliquot of the mixture can be streaked out to single colonies on nutrient medium (e.g., 
LB plates) supplemented with 100(ig/ml ampicillin, single colonies grown, and then 
plasmid DNA extracted using a standard minipreparation procedure. Next, a sample of the 
DNA minipreparation can be digested with a combination of the restriction enzymes Sail, 
and Noil, and the resultant products resolved on a 0.8% agarose gel using standard DNA 
electrophoresis conditions. The digest liberates fragments as follows: 

Mouse INTERCEPT 258 (clone EpT258): 1.8 kb 

The identity of the strains can be inferred from the fragments liberated. 

A clone containing a cDNA molecule encoding mouse TANGO 253 (Clone EpTm 
253) was deposited with American Type Culture Collection, 10801 University Boulevard, 
Manassas, VA 201 10-2209, on April 21, 1999 as Accession Number 207215. 

A clone containing a cDNA molecule encoding mouse TANGO 257 (Clone EpTm 
257) was deposited with American Type Culture Collection, 10801 University Boulevard, 
Manassas, VA 201 10-2209, on April 21, 1999 as Accession Number 207217. 

A clone containing a cDNA molecule encoding mouse TANGO 281 (Clone EpTm 
281) was deposited with American Type Culture Collection, 10801 University Boulevard, 
Manassas, VA 201 10-2209, on June 15, 1999 as patent deposit Number PTA-224. 
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All publications, patents and patent applications mentioned in this specification are 
herein incorporated by reference into the specification to the same extent as if each 
individual publication, patent or patent application was specifically and individually 
indicated to be incorporated herein by reference. 

Equivalents 

Those skilled in the art will recognize, or be able to ascertain using no more than 
routine experimentation, many equivalents to the specific embodiments of the invention 
described herein. Such equivalents are intended to be encompassed by the following 
claims. 
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What is claimed is: 

1. An isolated nucleic acid molecule selected from the group consisting of: 

a) a nucleic acid molecule comprising a nucleotide sequence which is at least 
45% identical to the nucleotide sequence of SEQ ID NO:l, 2, 26, 27, 46, 47, the cDNA 
insert of the plasmid deposited with the ATCC® as Accession Number 207222, or a 
complement thereof; 

b) a nucleic acid molecule comprising a fragment of at least 300 nucleotides 
of the nucleotide sequence of SEQ ID NO:l, 2, 15, 16, 26, 27, 46, 47, the cDNA insert of 
the plasmid deposited with the ATCC® as Accession Number 207222, or a complement 
thereof; 

c) a nucleic acid molecule which encodes a polypeptide comprising the amino 
acid sequence of SEQ ID NO:3, 17, 28, 48, or the amino acid sequence encoded by the 
cDNA insert of the plasmid deposited with the ATCC® as Accession Number 207222; 

d) a nucleic acid molecule which encodes a fragment of a polypeptide 
comprising the amino acid sequence of SEQ ID NO:3, 28, 48, or the amino acid sequence 
encoded by the cDNA insert of the plasmid deposited with the ATCC® as Accession 
Number 207222, wherein the fragment comprises at least 15 contiguous amino acids of 
SEQ ID NO:3, 28, 48, or the amino acid sequence encoded by the cDNA insert of the 
plasmid deposited with the ATCC® as Accession Number 207222; 

e) a nucleic acid molecule which encodes a naturally occurring allelic variant 
of a polypeptide comprising the amino acid sequence of SEQ ID NO:3, 17, 28, 48, or the 
amino acid sequence encoded by the cDNA insert of the plasmid deposited with the 
ATCC® as Accession Number 207222, wherein the nucleic acid molecule hybridizes to a 
nucleic acid molecule comprising SEQ ID NO:2, 16, 27, 47, or a complement thereof 
under stringent conditions; 

f) a nucleic acid molecule comprising a nucleotide sequence which is at least 
95% identical to the nucleotide sequence of SEQ ID NO:21, 22, the cDNA insert of the 
plasmid deposited with the ATCC® as Accession Number 207217, or a complement 
thereof; 

g) a nucleic acid molecule comprising a fragment of at least 300 nucleotides 
of the nucleotide sequence of SEQ ID NO:21, 22, the cDNA insert of the plasmid 
deposited with the ATCC® as Accession Number 207217, or a complement thereof; 
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h) a nucleic acid molecule which encodes a polypeptide comprising the amino 
acid sequence of SEQ ID NO:23, or the amino acid sequence encoded by the cDNA insert 
of the plasmid deposited with the ATCC® as Accession Number 207217; 

i) a nucleic acid molecule which encodes a fragment of a polypeptide 
comprising the amino acid sequence of SEQ ID NO:23, or the amino acid sequence 
encoded by the cDNA insert of the plasmid deposited with the ATCC® as Accession 
Number 207217, wherein the fragment comprises at least 360 contiguous amino acids of 
SEQ ID NO:23, or the amino acid sequence encoded by the cDNA insert of the plasmid 
deposited with the ATCC® as Accession Number 207217; 

j) a nucleic acid molecule which encodes a naturally occurring allelic variant 
of a polypeptide comprising the amino acid sequence of SEQ ID NO:23, or the amino acid 
sequence encoded by the cDNA insert of the plasmid deposited with the ATCC® as 
Accession Number 207217, wherein the nucleic acid molecule hybridizes to a nucleic acid 
molecule comprising SEQ ID NO:22, or a complement thereof under stringent conditions; 

k) a nucleic acid molecule comprising a nucleotide sequence which is at least 
45% identical to the nucleotide sequence of SEQ ID NO:37, 38, the cDNA insert of the 
plasmid deposited with the ATCC® as Accession Number 207221, or a complement 
thereof; 

1) a nucleic acid molecule comprising a fragment of at least 300 nucleotides 
of the nucleotide sequence of SEQ ID NO:37, 38, the cDNA insert of the plasmid 
deposited with the ATCC® as Accession Number 20722 1 , or a complement thereof; 

m) a nucleic acid molecule which encodes a polypeptide comprising the amino 
acid sequence of SEQ ID NO:39, or the amino acid sequence encoded by the cDNA insert 
of the plasmid deposited with the ATCC® as Accession Number 20722 1 ; 

n) a nucleic acid molecule which encodes a fragment of a polypeptide 
comprising the amino acid sequence of SEQ ID NO:39, or the amino acid sequence 
encoded by the cDNA insert of the plasmid deposited with the ATCC® as Accession 
Number 207221, wherein the fragment comprises at least 160 contiguous amino acids of 
SEQ ID NO:39, or the amino acid sequence encoded by the cDNA insert of the plasmid 
deposited with the ATCC® as Accession Number 207221; 

o) a nucleic acid molecule which encodes a naturally occurring allelic variant 
of a polypeptide comprising the amino acid sequence of SEQ ID NO:39, or the amino acid 
sequence encoded by the cDNA insert of the plasmid deposited with the ATCC® as 
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Accession Number 207217, wherein the nucleic acid molecule hybridizes to a nucleic acid 
molecule comprising SEQ ID NO:38, or a complement thereof under stringent conditions; 

p) a nucleic acid molecule comprising a nucleotide sequence which is at least 
45% identical to the nucleotide sequence of SEQ ED NO:8, 9, the cDNA insert of the 
plasmid deposited with the ATCC® as Accession Number 207215, or a complement 
thereof; 

q) a nucleic acid molecule comprising a fragment of at least 300 nucleotides 
of the nucleotide sequence of SEQ ID NO:8, 9, the cDNA insert of the plasmid deposited 
with the ATCC® as Accession Number 207215, or a complement thereof; 

r) a nucleic acid molecule which encodes a polypeptide comprising the amino 
acid sequence of SEQ ID NO: 10, or the amino acid sequence encoded by the cDNA insert 
of the plasmid deposited with the ATCC® as Accession Number 207215; 

s) a nucleic acid molecule which encodes a fragment of a polypeptide 
comprising the amino acid sequence of SEQ ID NO: 10, or the amino acid sequence 
encoded by the cDNA insert of the plasmid deposited with the ATCC® as Accession 
Number 207215, wherein the fragment comprises at least 15 contiguous amino acids of 
SEQ ID NO: 10, or the amino acid sequence encoded by the cDNA insert of the plasmid 
deposited with the ATCC® as Accession Number 207215; 

t) a nucleic acid molecule which encodes a naturally occurring allelic variant 
of a polypeptide comprising the amino acid sequence of SEQ ID NO: 10, or the amino acid 
sequence encoded by the cDNA insert of the plasmid deposited with the ATCC® as 
Accession Number 207215, wherein the nucleic acid molecule hybridizes to a nucleic acid 
molecule comprising SEQ ID NO:9, or a complement thereof under stringent conditions; 

u) a nucleic acid molecule comprising a nucleotide sequence which is at least 
95% identical to the nucleotide sequence of SEQ ED NO: 15, 16, the cDNA insert of the 
plasmid deposited with the ATCC® as Accession Number 207222, or a complement 
thereof; 

v) a nucleic acid molecule which encodes a fragment of a polypeptide 
comprising the amino acid sequence of SEQ ID NO: 17, or the amino acid sequence 
encoded by the cDNA insert of the plasmid deposited with the ATCC® as Accession 
Number 207222, wherein the fragment comprises at least 360 contiguous amino acids of 
SEQ ID NO: 1 7, or the amino acid sequence encoded by the cDNA insert of the plasmid 
deposited with the ATCC® as Accession Number 207222. 
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w) a nucleic acid molecule comprising a nucleotide sequence which is at least 
45% identical to the nucleotide sequence of SEQ ID NO:56, 57, the cDNA insert of the 
plasmid deposited with the ATCC® as patent deposit Number PTA-224, or a complement 
thereof; 

x) a nucleic acid molecule comprising a fragment of at least 300 nucleotides 
of the nucleotide sequence of SEQ ID NO:56, 57, the cDNA insert of the plasmid 
deposited with the ATCC® as patent deposit Number PTA-224, or a complement thereof; 

y) a nucleic acid molecule which encodes a polypeptide comprising the amino 
acid sequence of SEQ ID NO:58, or the amino acid sequence encoded by the cDNA insert 
of the plasmid deposited with the ATCC® as patent deposit Number PTA-224; 

z) a nucleic acid molecule which encodes a fragment of a polypeptide 
comprising the amino acid sequence of SEQ ID NO:58, or the amino acid sequence 
encoded by the cDNA insert of the plasmid deposited with the ATCC® as patent deposit 
Number PTA-224, wherein the fragment comprises at least 15 contiguous amino acids of 
SEQ ID NO:58, or the amino acid sequence encoded by the cDNA insert of the plasmid 
deposited with the ATCC® as patent deposit Number PTA-224; 

aa) a nucleic acid molecule which encodes a naturally occurring allelic variant 
of a polypeptide comprising the amino acid sequence of SEQ ID NO:58, or the amino acid 
sequence encoded by the cDNA insert of the plasmid deposited with the ATCC® as patent 
deposit Number PTA-224, wherein the nucleic acid molecule hybridizes to a nucleic acid 
molecule comprising SEQ ID NO: 57, or a complement thereof under stringent conditions. 

2. The isolated nucleic acid molecule of claim 1, which is selected from the 
group consisting of: 

a) a nucleic acid comprising the nucleotide sequence of SEQ ID NO: 1,2, 15, 
16, 26, 27, 46, 47, the cDNA insert of the plasmid deposited with the ATCC® as 
Accession Number 207222, or a complement thereof; 

b) a nucleic acid molecule which encodes a polypeptide comprising the amino 
acid sequence of SEQ ID NO:3, 17, 28, 48, or the amino acid sequence encoded by the 
cDNA insert of the plasmid deposited with the ATCC® as Accession Number 207222; 

c) a nucleic acid comprising the nucleotide sequence of SEQ ID NO:2 1 , 22, 
the cDNA insert of the plasmid deposited with the ATCC® as Accession Number 207217, 
or a complement thereof; 
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d) a nucleic acid molecule which encodes a polypeptide comprising the amino 
acid sequence of SEQ ID NO:23, or the amino acid sequence encoded by the cDNA insert 
of the plasmid deposited with the ATCC® as Accession Number 207217; 

e) a nucleic acid comprising the nucleotide sequence of SEQ ID NO:37, 38, 
the cDNA insert of the plasmid deposited with the ATCC® as Accession Number 207221, 
or a complement thereof; 

f) a nucleic acid molecule which encodes a polypeptide comprising the amino 
acid sequence of SEQ ID NO:39, or the amino acid sequence encoded by the cDNA insert 
of the plasmid deposited with the ATCC® as Accession Number 207221; 

g) a nucleic acid comprising the nucleotide sequence of SEQ ID NO: 8, 9, the 
cDNA insert of the plasmid deposited with the ATCC® as Accession Number 207215, or 
a complement thereof; 

h) a nucleic acid molecule which encodes a polypeptide comprising the amino 
acid sequence of SEQ ID NO: 10, or the amino acid sequence encoded by the cDNA insert 
of the plasmid deposited with the ATCC® as Accession Number 207222. 

i) a nucleic acid comprising the nucleotide sequence of SEQ ID NO:56, 57, 
the cDNA insert of the plasmid deposited with the ATCC® as patent deposit Number 
PTA-224, or a complement thereof; 

j) a nucleic acid molecule which encodes a polypeptide comprising the amino 
acid sequence of SEQ ID NO:58, or the amino acid sequence encoded by the cDNA insert . 
of the plasmid deposited with the ATCC® as patent deposit Number PTA-224. 

3. The nucleic acid molecule of claim 1 further comprising vector nucleic acid 
sequences. 

4. The nucleic acid molecule of claim 1 further comprising nucleic acid 
sequences encoding a heterologous polypeptide. 

5. A host cell which contains the nucleic acid molecule of claim 1 . 

6. The host cell of claim 5 which is a mammalian host cell. 
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7. A non-human mammalian host cell containing the nucleic acid molecule of 
claim 1 . 



8. An isolated polypeptide selected from the group consisting of: 

a) a fragment of a polypeptide comprising the amino acid sequence of SEQ ID 
NO:3, 10, 17, 23, 28, 39, 48, 58, wherein the fragment comprises at least 15 contiguous 
amino acids of SEQ ID NO:3, 10, 17, 23, 28, 39, 48, 58; 

b) a naturally occurring allelic variant of a polypeptide comprising the amino 
acid sequence of SEQ ID NO:3, 10, 17, 23, 28, 39, 48, 58, or the amino acid sequence 
encoded by the cDNA insert of plasmids deposited with the ATCC® as Accession 
Number 207222, Accession Number 207215, Accession Number 207217, Accession 
Number 207221, patent deposit Number PTA-224 wherein the polypeptide is encoded by 
a nucleic acid molecule which hybridizes to a nucleic acid molecule comprising SEQ ID 
NO:2, 9, 16, 22, 27, 38, 47, 57, or a complement thereof under stringent conditions; and 

c) a polypeptide which is encoded by a nucleic acid molecule comprising a 
nucleotide sequence which is at least 45% identical to a nucleic acid comprising the 
nucleotide sequence of SEQ ID NO:2, 9, 27, 38, 47, 57, or at least 98% to a nucleic acid 
comprising the nucleotide sequence of SEQ ID NO:2, 9, 27, 38, 47, 57, or a complement 
thereof. 

9. The isolated polypeptide of claim 8 comprising the amino acid sequence of 
SEQ ID NO:3, 10, 17, 23, 28, 39, 48, 58. 

10. The polypeptide of claim 8 further comprising heterologous amino acid 
sequences. 

11. An antibody which selectively binds to a polypeptide of claim 8. 



12. A method for producing a polypeptide selected from the group consisting 

of: 

a) a polypeptide comprising the amino acid sequence of SEQ ED NO:3, 10, 
17, 23, 28, 39, 48, 58, or the amino acid sequence encoded by the cDNA insert of the 
plasmid deposited with the ATCC® as Accession Number 207222, Accession Number 
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207215, Accession Number 207217, Accession Number 207221, or patent deposit 
Number PTA-224; 

b) a polypeptide comprising a fragment of the amino acid sequence of SEQ ID 
NO:3, 10, 17, 23, 28, 39, 48, 58, or the amino acid sequence encoded by the cDNA insert 
of the plasmid deposited with the ATCC® as Accession Number 207222, Accession 
Number 207215, Accession Number 207217, Accession Number 207221, or patent 
deposit Number PTA-224, wherein the fragment comprises at least 15 contiguous amino 
acids of SEQ ID NO:3, 10, 17, 23, 28, 39, 48, 58, or the amino acid sequence encoded by 
the cDNA insert of the plasmid deposited with the ATCC® as Accession Number 207222, 
Accession Number 207215, Accession Number 207217, Accession Number 207221 or 
patent deposit Number PTA-224; and 

c) a naturally occurring allelic variant of a polypeptide comprising the amino 
acid sequence of SEQ ID NO:3, 10, 17, 23, 28, 39, 48, 58, or the amino acid sequence 
encoded by the cDNA insert of the plasmid deposited with the ATCC® as Accession 
Number 207222, Accession Number 207215, Accession Number 207217, Accession 
Number 207221, or patent deposit Number PTA-224, wherein the polypeptide is encoded 
by a nucleic acid molecule which hybridizes to a nucleic acid molecule comprising SEQ 
ID NO:l, 8, 15, 21, 26, 37, 46, 56, or a complement thereof under stringent conditions; 

comprising culturing the host cell of claim 5 under conditions in which the nucleic 
acid molecule is expressed. 

13. A method for detecting the presence of a polypeptide of claim 8 in a 
sample, comprising: 

a) contacting the sample with a compound which selectively binds to a 
polypeptide of claim 8; and 

b) determining whether the compound binds to the polypeptide in the sample. 

14. The method of claim 1 3, wherein the compound which binds to the 
polypeptide is an antibody. 

15. A kit comprising a compound which selectively binds to a polypeptide of 
claim 8 and instructions for use. 
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16. A method for detecting the presence of a nucleic acid molecule of claim 1 
in a sample, comprising the steps of: 

a) contacting the sample with a nucleic acid probe or primer which selectively 
hybridizes to the nucleic acid molecule; and 

b) determining whether the nucleic acid probe or primer binds to a nucleic 
acid molecule in the sample. 

17. The method of claim 16, wherein the sample comprises mRNA molecules 
and is contacted with a nucleic acid probe. 

18. A kit comprising a compound which selectively hybridizes to a nucleic acid 
molecule of claim 1 and instructions for use. 

19. A method for identifying a compound which binds to a polypeptide of 
claim 8 comprising the steps of: 

a) contacting a polypeptide, or a cell expressing a polypeptide of claim 8 with 
a test compound; and 

b) determining whether the polypeptide binds to the test compound. 

20. The method of claim 19, wherein the binding of the test compound to the 
polypeptide is detected by a method selected from the group consisting of: 

a) detection of binding by direct detecting of test compound/polypeptide 
binding; 

b) detection of binding using a competition binding assay; 

c) detection of binding using an assay for TANGO 253, TANGO 257, 
INTERCEPT 258, TANGO 281 -mediated signal transduction. 

21 . A method for modulating the activity of a polypeptide of claim 8 
comprising contacting a polypeptide or a cell expressing a polypeptide of claim 8 with a 
compound which binds to the polypeptide in a sufficient concentration to modulate the 
activity of the polypeptide. 
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22. A method for identifying a compound which modulates the activity of a 
polypeptide of claim 8, comprising: 

a) contacting a polypeptide of claim 8 with a test compound; and 

b) determining the effect of the test compound on the activity of the 
polypeptide to thereby identify a compound which modulates the activity of the 
polypeptide. 
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CACCAACTGGAGGGTCCGGAGTAGCGAGCGCCCCGAAGGAGGCCATCGGGGAG 
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TGCCCACTCCAAAGTGAGCTCATGCICrCACTC^T^ 
CXXXXXCCTGGAATATTGTGAATGACTAG^^ 

FIG. 1A 



1/61 



WO 00/78808 



PCT/US00/16883 



ACTXSGCTGTCTGCGATCAGGTCTGGC^ 1156 

CK3CAAGTGTAAGTCCCCCAGTTGCTCTGGTCCAGGAGCC 1 2 3 5 

ATCCTCCCC^CCCCCTCCTGCTCCTGGGGCCGGCCCTTTTCTCAGAG ATCACTCAATAAACCTAAGAACCCTCCAAAAA 1314 
AAAAAAAAAAAAAAAGGGCGGCCGC 13 3 9 
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C*rx III 



I I I I J I J I I 1 1 . , I , I , I I , , • I I I 

1 41 81 121 1G1 201 241 



kRPLLVLLLLGLAAGSPPLDDNKIPSLCPGHPGLPGTPGHHGSQGLPGRDGRDGRDGAPG 
APGEKGEGGRPGLPGPRGDPGPRGEAGPAGPTGPAGECSVPPRSAFSAKRSESRVPPPSD 
APLPFDRV^VNEQGHYDAVTGKFTCQ VPGVYYFAVHATVYRAS LQ FDLVKNG ES IASFFQ 
FFGGWPKPASLSGGAMVRLEPEDQVWVQVGVGDYIGIYASIKTDSTFSGFLVYSDWHSSP 
VFA 
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GTCGACCCACGCGTCCGCCXTTOTGAAGCCAG 7 9 

M R P L L A 6 

CTGCCTACAGACTACAAGAGAGGTTCCTGGAGTCTGAGCCTCCGGGGTCACCACC ATG AGG CCA CTT CTT GCC 152 

LLL LGLVS G SPPLDDNKI PS 26 

CTT CTG CTT CTG GGT CTG GTG TCA GGC TCT CCT CCT CTG GAC GAC AAC AAG ATC CCC AGC 212 

LCP GQPGL PGTPGHHGSQGL 46 

CTG TGT CCC GGG CAG CCC GGC CTT CCA GGC ACA CCA GGT CAC CAT GGC AGC CAA GGC CTG 272 

PGR DGRDGRDGAPGAPGEKG 66 

CCT GGC CGT GAC GGC CGT GAT GGC CGC GAC GGT GCA CCC GGA GCT CCG GGA GAG AAA GGC 332 

EGG RPGLPGPRGEPGPRGEA 86 

GAG GGC GGG AGA CCG GGA CTA CCT GGC CCA CGT GGG GAG CCC GGG CCG CGT GGA GAG GCA 392 

GPMGAIGPAGECSVPPRSAF 106 

GGG CCC ATG GGG GCT ATC GGG CCT GCG GGG GAG TGC TCG GTA CCC CCA CGA TCA GCC TTC 452 

SAK RSES RVPPPADTPLPFD 126 

AGT GCC AAG CGA TCC GAG AGC CGG GTA CCT CCG CCA GCC GAC ACA CCC CTA CCT TTC GAC 512 

RVL LNE Q G HYDPTTGKFTCQ 146 

CGT GTG CTG CTA AAT GAG CAG GGC CAT TAC GAC CCC ACT ACT GGC AAG TTC ACC TGC CAA 572 

VPGVYYF AVHATVYRASLQF 166 

GTG CCT GGC GTC TAC TAC TTT GCT GTG CAC GCC ACT GTC TAC CGG GCC AGC TTG CAG TTT 632 

DLVKNGQS IA SFFQYFGGWP 186 

SAT CTT GTC AAA AAC GGG CAG TCC ATC GCC TCT TTC TTC CAG TAT TTT GGG GGG TGG CCC 692 

KPAS LSGGAMVRLEPEDQVW 206 

VAG CCA GCC TCG CTC TCA GGG GGT GCG ATG GTA AGG CTA GAA CCT GAG GAC CAG GTG TGG 752 

VQV G VGD Y I G IYAS.I KTDS T 226 

5TG CAG GTG GGC GTG GGT GAT TAC ATT GGC ATC TAT GCC AGC ATC AAG ACA GAC AGT ACC 812 

FS G F L V Y S DWKSS PVFA* 244 

TTC TCT GGA TTT CTC GTC TAT TCT GAC TGG CAC AGC TCC CCA GTC TTC GCT TAA 866 

AACACAGTGAACCCGGAGCTGGCACTTGCT 945 

TGGCCCCCTGGAATATTGTGAATGACTTAGGAAG 1024 

fiGCTCTCTGAGGTCAAGACAGCGTGGAGCAGTGGCTC I 103 

TtX3GTCCTGGCCCAGGACTCCAAGGTGGGATGC^ 1182 
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GCIXXntXX^GCKX^GGCCTTTTTCT^ 1251 
<jc 1263 
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>mT253 

MRPLIJVLLLLGLVSGSPPLDDNKIPSIX:PGQPGLPGTPGHHGSQGLPGEUX3RDGRDGA.PG 
APGEKGEGGRPGLPGPRGEPGPRGEAGPMGAIGPAGECSVPPRSAFSAKRSESRVPPPAD 
TPLPFDRVLLNEQGHYDPTTGKFTCQVPGVYTFAW 

YFGGWPKPASLSGGAhfVRLEPEDQVVA^QVGVGDyiGIYASIKTDSTFSGFLVYSDWHSSP 
VFA 
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ALIGN calculates a global alignment of two sequences 
version 2.0uPlease cite: Myers and Killer, CABXOS (19B9) 

> hT2S3 a. a. 243 aa vs. 

> cnT253 a. a. 243 aa 
scoring matrix: pami20.mat, gap penalties: -12/-4 

93.8% identity; Global alignment score: 1239 

10 20 30 40 SO 60 70 

inputs MRJLLVLLLtX3tJ^GSPPLI)Dbfi<IPSLCPCHPGL 

KRPLLAJLIXLGLVSGSPPU5DNKIPSLC 

10 20 30 40 SO 60 70 



60 90 100 110 120 130 140 

inputs PGLPGPRGDPGPRGEAGPAGPTGPAGECSVPPRSAPSAXRSESRVPPPSDAPLPFDRVXiVNEQGKYDAVT 

::::::::.::::::::: :..::::::::::::::::::::::::::.:.::::;:::.:::::::..: 
PGLPGPRGEPGPRGEAGPKGAIGPAGECSVPPRSAFSAKRSESR\^PPAX>TPLProRVIXNEQGHYDPTT 
80 90 100 110 120 130 140 

ISO 160 170 180 190 200 210 

Inputs G KFTCQ VPGVYY FA VHATVYRAS LQ FD LV KNGESIASFFQF FGG WP KP AS LS GGAKVRL E P EDQ VWVQVG 
;:::::::::::::::::::;:;:::;:::::.::;:;:;.:::::;:::::;::::»:::::::::::: 
G KFTGQVPCVYY FA VHATVYRAS LQ FDLVKNGQ S I AS F FQ Y FGGWP KP AS LS GGAKVRLE P EDQVWVQVG 
150 160 170 180 190 200 210 

220 230 240 

inputs VGOYIGIYASIKTDSTFSGFLVYSDWHSSPVTA 



VGO Y IG I Y AS I KTDST FSG FL VYS DWHS S P VFA 
220 230 240 



FIG. 5 
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ALIGN calculates a global alignment of two sequences 
version 2.0uPlease cite: Myers and Miller, CAB10S 

> hT2S3 a. a. %Zl « 

> SwissProt Q15848 - (untitled) 244 aa 
scoring matrix: paml20.mat, gap penalties: -12/-4 

38.7* identity; Global alignment score: 262 

in 20 30 40 50 60 

inputs MRPL-LVLLLLGLAA- - -GSPPLDDNKIPSL- - - -CPG-HPGLPGTPGHHGSQGLPGRDGRDGRDGAPGA 

MLLLGA^LLLALPG^^ 

10 20 30 40 SO «> u 

•7ft 80 90 100 HO 120 130 

inputs PGEKGEGGRPGLPGPRGDPGPRGEAGPAGPTGPAGECSVPPRSAFSAKRSESRyPPPSDAPLPTORVLVN 

PGLIGPKGDIGETGVPGAEGPRGFPGIQGRKG^ 

60 90 100 110 120 

inputs EQGHYDAVTCKFTCQVPGVYYFAVHATVYRASLQFD 

W^STC^ipG^FAY^li^^ 

170 180 190 



140 150 160 



210 220 230 240 

inputs EDQWA/QV-GVGDYIGIYASIKTDSTFSGFLVYSDWHSSPVFA 

GDQVVnifQVYGEGERNGLYAJDNDNDSTFTGFLLiY- - -HDT- - -N 
210 220 230 240 



FIG. 6A 
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ALIGN calculates a global alignment of two sequences 
version 2.0uPlease cite: Myers and Miller, CABIOS (1989) 

> mT253 a. a. 243 aa vs. 

> SwissProt Q15848 - (untitled) 244 aa 
scoring matrix: paml20.mat, gap penalties; -12/-4 

38.3V identity; Global alignment score: 264 

10 20 30 40 50 60 

inputs MRPLLALLLLGLVS GS PPLDDNKI PSL - CPG -QPGLPGTPGHHGSQGLPGRDGRDGRDGAPGA 

: ::.::: . . : . . . : . . : : : : : : : : . : . 

MLL1XJAVT.LLLALPGHDQETTTQGP 

10 20 30 40 50 60 70 

70 80 90 100 110 120 130 

inputs PGEKGEGGRPGLPGPRGEPGPRGEAGPMGAIGPAGECSVPPRSAFSAKRSESRVPPPADTPLPFDRVLLN 
:: : : : ; : :.::. :::::• :.: 

PGLIGPKGDIGETGVPGAEGPRGFPGIQGRKGEPGEGAYVYRS AFSVGL- ETYVTI P - NMPIRFTKI FYN 
80 90 100 110 120 130 

140 150 160 170 180 190 200 

inputs EQGHYDPTTGKFTCQVPGVYYFAVKATVYRASLQFDLV^ 

• ••••• • ■ • • • »••••••••• • • • • «• • • " • 

QQNHYDGSTGKFHCNI PGLYYFAYK ITVYMKDVKVS LFKKDKAMLFTYDQYQE - NNWQASGS VTjLHLEV 
140 150 160 170 180 190 200 

210 220 230 2£0 

inputs EIXJVWQV-GVGDYIGIYASIKTDSTFSGFLVVSDWHSSPVFA 

GDQVWLQVYGEGERNGLYADNDNDSTFTGFLLY HDT N 

210 220 230 240 



FIG. 6B 
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ALIGN calculates a global alignment of two sequences 
version 2.0uPleaee cite: Myers and Miller, CABIOS (1989) 

> hT253 n.a. 1339 aa vs. 

> adipocyte n.a. (AI417523) 653 aa 
scoring matrix: paml20.mat, gap penalties: -12/-4 

29.1% identity; Global alignment score: -1168 

10 20 30 40 SO 60 70 

inputs GTCGACCCACGCGTCCGGGACTGGGGTGACGGCAGGGCAGGGGGCGCCTGGCCGGGGAGAAGCGCGGGGG 

: . ::.: : .::: :.. :. :::: :..: :* 55 :: - 

TTTTTT GCAT — GTAACTTTTTTATTGA GGCA C AAC AAGGC ATTGT AACTTG CCTGG A 

10 20 30 40 50 

60 90 100 110 120 130 140 

inputs CTGGAGCACCACCAACTGGAGGGTCCGGAGTAGCGAGCGCCCCGAAGGAGGCCATCGGGGAGCCGGGAGG 
: : : : : . : . : :.:.:::•::: : : : « : • s «•:••• 

CTTGAG GCAGT CAGTTTAGTAAGCT GAA- CGTTAATACAGTTAA 

60 70 80 90 

150 160 . 170 180 190 200 210 

inputs GGGGACTGCGAGAGGACCCCGGCGTCCGGGCTCCCGGTGCCAGCGCTATGAGGCCACTCCTCGTCCTGCT 

::::.:•:::. : : : : • : - 

GGATTAAG TGCAAACAATATA CATTC ACA 

100 HO 120 

220 230 240 250 260 270 280 

inputs GCTCCTGGGCCTGGCGGCCGGCTCGCCCCCACTGGACGACAACAAGATCCCCAGCCTCTGCCCGGGGCAC 



• . • • • 



GCT — TGA — CTAGCGA — GGCT ACATCA-CAATTTATAAAG TGCCAGA 

130 140 150 160 170 

290 300 310 320 330 340 350 

inputs CCCGGCCTTCCAGGCACGCCGGGCCACCATGGCAGCCAGGGCTTGCCGGGCCGCGATGGCCGCGACGGCC 



TT — AGT GCTAA TTGTCATTCA — GCTTG ATTTTTCAC 

180 190 200 

360 370 380 390 400 410 420 

inputs GCGACGGCGCGCCCGGGGCTCCGGGAGAGAAAGGCGAGGGCGGGAGGCCGGGACTGCCGGGACCTCGATO 

CTCAGGAAGGAAAA — CAAAAAAGTAAGG ACC TCCTC 

210 220 230 

430 440 450 460 470 480 490 

inputs GGACCCCGGGCCGCGAGGAGAGGCGGGACCCGCGGGGCCXIACCGGGC 
it : : :::. 

CCTCTAGGAA- 

240 

500 510 520 530 540 550 560 

inputs CCGCGATCCGCCTTCAGCX5CCAAGCGCTCCGA 

:.:.:.:• : s : : 
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CAAAAAACATTTTCCT — AAACCAA 

250 260 270 

570 580 590 600 610 620 630 

inputs TCGACCGCGTGCTGGTGAACGAGCAGGGACATTACGACGCCGTCACCGGCAAGTTCACCTGCCAGGTGCC 
• ::: . :: ::: • :: ::. 

TCAGTC ATGA — GGGCAAAGAC — TACTTTTCCTTCA ATC-CCA — CTAAT 

280 290 300 , 310 

640 650 660 670 680 690 700 

inputs TGGGGTCTACTACTTCGCCGTCCATGCCACCGTCTACCGGGCCAGCCTGCAGTTTGATCTGGTGAAGAAT 
s • * • • : : : : : : 

TAGAA CACCATCC TTTTAT T 

320 330 

710 720 730 740 750 760 770 

inputs GGCGAATCCATTGCCTCTTTCTTCCAGTTTTTCGGGGGGTGGCCCAAGCCAGCCTCGCTCTCGGGGGGGG 
5 s • 2 • • : . : j : ; . . : 

GTCAATACTGT ACTGACTTTCAAT CTTG 

340 350 360 

780 790 800 810 820 830 840 

inputs CCATGGTGAGGCTGGAGCCTGAGGACCAAGTGTGGGTGCAGGTGGGTGTGGGTGACTACATTGGCATCTA 



ATAAAGAAGAT — AG CCTG AAAAC GTAGAATAT 

370 380 390 

850 860 870 880 890 900 910 

inputs TGCCAGCATCAAGACAGACAGCACCTTCTCCGGATTTCTGGTGTACTCCGACTGGCACAGCTCCCCAGTC 
: 

TTCCAGCTACT TCCATAAAT TGCTCCCCTGT- 

400 410 420 

920 930 940 950 960 970 980 

inputs TTTGCTTAGTGCCCACTGCAAAGTGAGCTCATGCTCTCACTCCTAGAAGGAGGGTGTGAGGCTGACAACC 

GCAGACGT — 

430 

990 1000 1010 1020 1030 1040 1050 

inputs TGGTCATCCAGGAGGGCTGGCCCCCCTGGAATATTGTGAATGACTAGGGAGGTGGGGTAGAGCACTCTCC 

:::: : :::::::: x:: : 

7 AACCATAT CTGGTCTCCCTGGAA -GAGCTGAAGAATTGCATGAT — 

440 450 460 470 

1060 1070 1080 1090 1100 1110 1120 

inputs GTCCTGCTGCTGGCAAGGAATGGGAACAGTGGCTKJTCTG 

i ..:r in :nt t; u : ::ix 

TGCTAGCA GTTTCA-TGG TCTG-GAGCA C CATCATTGG-CATAGGCT 

480 490 500 510* 520 

1130 1140 1150 1160 1170 1180 1190 
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inputs GGATTTCTGCCCAAGACCAGAGGAGTGTGCTGTGCT 

*:::::::• : . :::: .: ::: 

GATA CCAAGACCT CTT — CATTCTTCANTGAG GTTG-AC — ATACAG 

530 540 550 560 

1200 1210 1220 1230 1240 1250 1260 

inputs GAGCCCACGGTGGGGTGCTCTCTTCCTGGTCCTCTGCTTCTCTGGATCCTCCCCACCCCCTCCTGCTCCT 

***** «••••••••«••• • ••••• 

TGGCACAT TCACTGCCAG — CTTTTACATGTGAAAAA TGAAAAACGT 

570 580 590 600 

1270 1280 1290 1300 1310 1320 1330 

inpu t s GGGGCCGGCCCTTTTCTCAGAGATCACTCAATAAACCTAAGAACCCTCCAAAAAAAAAAAAAAAAAAAAG 

: : 

AGTGCCA TTCACTTGG — CA ATTAAATCTA CCAAAGCTGAGATCAAA — 

610 620 630 640 650 



inputs GGCGGCCGC 
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ALIGN calculates a global alignment of two sequences 
version 2.0uPlease citei Myers and Miller , CAB I OS (1989) 

> mT253 n.a. 1263 aa vs. 

> adipocyte n.a. (AI417523) 653 aa 
scoring matrix: paml20.mat, gap penalties: -12/-4 

30.4% identity; Global alignment score: -840 

10 20 30 40 50 . 60 70 

inputs GTCGACCCACGCGTCCGCGCTGTGAAGCCAGCAAGGAGCAACCAGAAGCTAGGAGTCAGTCAGCAAGGAC 



• • • ■ • 



TT TTT TGCATGTAACTT TTTTATTGAGGCA — CAACAAGG-C 

10 20 30 

80 90 100 110 120 130 140 

inputs AGGGGCTGCCTGCCTACAGACTACAAGAGAGGTTCCTGGAGTCTGAGCCTCCGGGGTCACCACCATGAGG 

• • • * • ••*••«• ••••«• 

• ■ ••••• •••«•*• ••••••• 

ATTG TAACT TGCCTGGA CTTGAGG 

40 50 60 

150 160 170 180 190 200 210 

inputs CCACTTCTTGCCCTTCTGCTTCTGGGTCTGGTGTCAGGCTCTCCTCCTCTGGACGACAACAAGATCCCCA 



CAG TCAGTTT AGTAAG CTGAACGTTAATA 

70 80 90 

220 230 240 250 260 270 280 

inputs GCCTGTGTCCCGGGCAGCCCGGCCTTCCAGGCACACCAGGTCACCATGGCAGCCAAGGCCTGCCTGGCCG 
: . : : . : : . : : •::::::.: • : : ::.::::::•::: 

— CAGTTA — AGGA TTAAGTGCAAACAATAT ACATTCACAGCTTGACTAGC-G 

100 110 120 130 140 

290 300 310 320 330 340 350 

inputs TGACGGCCGTGATGGCCGCGACGGTGCACCCGGAGCTCCGGGAGAGAAAGGCGAGGGCGGGAGACCGGGA 

::.: ::.:: :.:..: :. 

AGGCTAC ATCACAATTTATAAAGTGC CAGATTA GTG 

150 160 170 

360 370 380 390 400 410 420 

inputs CTACCTGGCCCACGTGGGGAGCCCGGGCCGCGTGGAGAGGCAGGGCCCAT 

:: : .:• :: ::•• . :: ::. :: ..:.. 

CTAATTGTCATTCA ■ GCTTGATTTTTCA CCTCAGGAA -GG AAAACAA 

180 190 200 210 220 

430 440 450 460 470 480 490 

inputs GGGAGTGCTCGGTACCCCGACX3ATCAGCCTTCAGTGCCAAGCX2ATCC 

...:::. x t : x i • : its : . : : t % • i •t.i.i • x . : : : 

AAAAGTA AGGACCTCCTC CCT -CTAG-GAACAAAAAAC-ATTTTCCTA 

230 240 250 260 

500 510 520 530 540 550 560 

inputs CGACACACXTCCTACCTTTCGACCGTC 

:.::.x ::.::.. :.: ::::::: : -t: 
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AACCAATCAGTCATGAG-GGCAAAGACTACTTTTCCTT — CAATCCCACTAAT TAG 

270 280 290 300 310 

570 580 590 600 610 620 630 

inputs TTCACCTGCCAAGTGCCTGGCGTCTACTACTTTGCTGTGCACGCCACTGTCTACCGGGCCAGCTTGCAGT 
•«::::•::••:• :: 

AACACCATCCTTTTA — TTG TCAATACTGT ACTGACTT 

320 330 340 350 

640 650 660 670 680 690 700 

inputs TTGATCTTGTCAAAAACGGGCAGTCCATCGCCTCTTTCTTCCAGTATTTTGGGGGGTGGCCCAAGCCAGC 

TCAATCTT GATAAAGAAGATAGCC 

360 370 

710 720 730 740 750 760 770 

inputs CTCGCTCTCAGGGGGTGCGATGGTAAGGCTAGAACCTGAGGACCAGGTGTGGGTGCAGGTGGGCGTGGGT 

TGAAAACGTAGAA TATTTCCAG C TAC 

380 390 400 

780 790 800 810 820 830 840 

inputs G ATTACATTGGCATCTATGCCAGCATCAAG ACAGACAGTACCTTCTCTGGATTTCTCGTCT ATTCTGACT 

:: ::: • :: :: : ::::: :::: : 

— TTCCATAAATTGCT CC — C — CTGTGCAGACGTAACCATATCTGG TCTC — C CT 

410 420 430 440 450 

850 860 870 880 890 900 910 

inputs GGCACAGCTCCCCAGTCTTCGCTTAAAACACAGTGAACCCGGAGCTGGCACTTGCTCCTCAGTGGAGGGT 
:: : :::: ::. s: :: : • s 5 s 

GG AAGAGCTGA — AGAATT-GCATGATT GCTAGCAGTTTC ATGGT 

460 470 480 490 



920 930. 940 950 960 970 980 

input s GTGACACTAACCCX5CGCAGCGCATACCAGGAGGGCTGGCCCCCTGGAATATTGTGAATGACTTAGGAAGA 

: . : : ! : : : : . s i : 

CTGGA GCACC ATCATTGGCATAGGCTGA 

500 510 520 



990 1000 1010 1020 1030 1040 1050 

inpu t s GAGGGAGCC^CTTCCAGTCCCAtfTCCTGGCAATC 

:.. ts. :::::: 

TACCAAGACCTCTTCATTCTT CAN TGAGGT — TGACA 

530 540 550 

1060 1070 1080 1090 1100 1110 1120 

inputs AGCAGTGGCTGGGTTTCTGCCCAGGACTTTAGAATGCAG 

• •ststsss. • : .:tn« ..tttt t*it i«. 

TACAGTGGCACATTCACTGCC AGCTTT TACA TGTGAAAAATGA AAAA 

560 570 580 590 600 

1130 1140 1150 1160 1170 1180 1190 
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inputs CTCCAAGGTGGGATGCTCCATTCCTAGTCCTGTGTCCCCTCTAGGTCCCTGACTCCATCTCTGCTGCTCC 

: :::::: * * « 

c GTAGTG CCATTC ACTTGG CAAT TAAATCTAC 

610 620 630 

1200 1210 1220 1230 1240 1250 1260 

input S CAGGGCAGGCCTTTTTCTCAGAGGTCACTTAATAAACCTAAAATCCTCAAAAAAAAAAAAAAAGGGCGGC 

::: ::::: 

CAAAGCTG AGA TCAAA . 

640 650 



inputs CGC 
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OTCGACCCACGCGTCCX3CGGACX3CGTGGGT 7 9 

MGPSTPLLILFLLSWSG 17 

AGGCTGCC ATG GGG CCC AGC ACC CCT CTC CTC ATC TTG TTC CTT TTG TCA TGG TCG GGA 138 

f- 

PLQGQQHHLVEYMERRLAAL 37 

CCC CTC CAA GGA CAG CAG CAC CAC CTT GTG GAG TAC ATG GAA CGC CGA CTA GCT GCT TTA 198 

EERLAQCQDQSSRHAAELRD 57 

GAG GAA CGG CTG GCC CAG TGC CAG GAC CAG AGT AGT CGG CAT GCT GCT GAG CTG CGG GAC 258 

FKNKMLP LLEVAEKEREALR 77 

TTC AAG AAC AAG ATG CTG CCA CTG CTG GAG GTG GGA GAG AAG GAG CGG GAG GCA CTC AGA 318 

TEADTI S GRVDRLEREVDYL 97 

ACT GAG GCC GAC ACC ATC TCC GGG AGA GTG GAT CGT CTG GAG CGG GAG GTA GAC TAT CTG 378 



E T Q N P A L P C 
GAG ACC CAG AAC CCA GCT CTG CCC TGT 



VEFDEKVTGGP 117 
GTA GAG TTT GAT GAG AAG GTG ACT GGA GGC CCT 438 



GTKGKGR RNEKYDMVTDCGY 137 

GGG ACC AAA GGC AAG GGA AGA AGG AAT GAG AAG TAC GAT ATG QTG ACA GAC TGT GGC TAC 498 

T I S QVR S MKI LKRF GG PA G h 157 

ACA ATC TCT CAA GTG AGA TCA ATG AAG ATT CTG AAG CGA TTT GGT GGC CCA GCT GGT CTA 558 

WTKDPLGQTEKIYVLDGTQN 177 

TGG ACC AAG GAT CCA CTG GGG CAA ACA GAG AAG ATC TAC GTG TTA GAT GGG ACA CAG AAT 618 

DTAFVF P RLRDFTLAMAAR K 197 

GAC ACA GCC TTT GTC TTC CCA AGG CTG CGT GAC TTC ACC CTT GCC ATG GCT GCC CGG AAA 678 

ASRVRVP FPWVGTGQLVYGG 217 

GCT TCC CGA GTC CGG GTG CCC TTC CCC TGG GTA GGC ACA GGG CAG CTG GTA TAT GGT GGC 738 

FL Y FAR R P PGR PGGG G EM E N 237 

TTT CTT TAT TTT GCT CGG AGG CCT CCT GGA AGA CCT GGT GGA GGT GGT GAG ATG GAG AAC 798 

TLQZiXK-F HLANRTVVD 88 VF 257 

ACT TTG CAG CTA ATC AAA TTC CAC CTG GCA AAC CGA ACA GTG GTG GAC AGC TCA GTA TTC 858 

P A £ G L I P PYGLTADTYIDIiA 277 

CCA GCA GAG GGG CTG ATC CCC CCC TAC GGC TTG ACA GCA GAC ACC TAC ATC GAC CTG GCA 918 

ADEEGLWAV YATREDDRHIiC 297 

GCT GAT GAG GAA GGT CTT TGG GCT GTC TAT GCC ACC CGG GAG GAT GAC AGG CAC TTG TGT 978 

f 

LAKLDPQTLDTEQQ WDTPCP 317 
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CTG GCC AAG TTA GAT CCA CAG ACA CTG GAC ACA GAG CAG CAG TGG GAC ACA CCA TGT CCC 1038 

RENAEAAFVICGTLYVVYN_T 337 

AGA GAG AAT GCT GAG GCT GCC TTT GTC ATC TGT GGG ACC CTC TAT GTC GTC TAT AAC ACC 1098 

RPASRAR IQCSFDASGTLTP 357 

CGT CCT GCC AGT CGG GCC CGC ATC CAG TGC TCC TTT GAT GCC AGC GGC ACC CTG ACC CCT 1158 

E R A A L P Y F PRRYGAHAS L R Y 377 

GAA CGG GCA GCA CTC CCT TAT TTT CCC CGC AGA TAT GGT GCC CAT GCC AGC CTC CGC TAT 1218 

NPRERQL YAWDDGYQI VYKL 397 

AAC CCC CGA GAA CGC CAG CTC TAT GCC TGG GAT GAT GGC TAC CAG ATT GTC TAT AAG CTG 1278 

EMRKKEE EV + 407 
GAG ATG AGG AAG AAA GAG GAG GAG GTT TGA 1308 

GGAGCTAGCCTTGTTTTTTGCATCTTTCTCACT 1387 

CTTCAAATGTGGGCCAGTTGTGGCTCAAATCCTCTATATTTTT 1466 

TCATACGGAACTCCAGATCCTGAGTAATCCTTTTAGAGCCCGAAGAGTCAAAACCCTCAATGTTCCCTCCTGCTCTCCT 154 5 

GCCCCATGTCAACAAATTTCAGGCTAAGGATGCCCCAGACCCAGGGCTC 1624 

AGGCAGCAGTGTTCTTCCCCTCAGAGTGACTTGGGGAGGGAGAAATAGGAGGAGACGTCCAGCTCTGTCCTCTCTTCCT 1703 

CACTCCTCCCTTGAGTGTCCTGAGGAACAGGACTTT 1782 
ATCCACTGCTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGGGCGGCCGC 1831 
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| i | . | • | • | . | • | • | < | < | • | ' | « I • I ' I ' ! ' I * I ' I * I ' I 
L 41 81 121 1G1 201 241 281 321 3B1 401 



MGPSTPLLILFLLSWSGPLQGQQHHLVEYMERRLAALEERLAQCQE>QSSRHAAELRpFKN 
KM^PLLEVAEKEREALRTEADTISGRVDRLEREVDYLETQNPALPCVEFDEKVTGGPGTK 
GKGRRNEKYDMVTDCGYTISQVRSMKILKRFGGPAGLWTKDPLGQTEKIYVLDGTQNDTA 
FVFPRLRDFTLAMAARKASRVRVPFPWVGTGQLVYGGFLYFARRPPGRPGGGGEMENTLQ 
LIKFHLANRTVVDSSVFPAEGLIPPyGLTADTYIDLAADEEGLWAVYATREDDRHLCLAK 
LDPQTLDTEQQWDTPCPRENAEAAFVICGTLYWYNTRPASRARIQCSFDASGTLTPERA 

ALPYFPRRYGAHAS LRYNPRERQLYAWDDG YQ IVYKLEMRKKEEEV 



FIG. 10 



18/61 



WO 00/78808 



PCT/US00/16883 



MGPSAPLLLLFF 
GTCOACCCACGCGTCCGACTTAAGGCTGCC ATG GGG CCC AGT GCT CCT CTG CTG CTC CTC TTC TTT 

LS WTGPLQGQQHHLVEYMER 
TTG TCA TGG ACG GGA CCC CTT CAG GGA CAG CAG CAC CAC CTT GTG GAG TAC ATG GAA CGC 



12 
66 

32 
126 



RLAALEERLAQCQDQ SSRHA 52 
CGA CTA GCT GCC TTA GAG GAA CGG CTG GCC CAA TGC CAG GAT CAG AGT AGT CGG CAT GCT 186 



AELRDFKNKMLPLLEVAEKE 
GCC GAG CTT CGG GAC TTC AAA AAC AAG ATG TTG CCT CTC CTG GAG GTG GCA GAG AAG GAG 

RETLRTEADS ISGRVDRLER 
CGG GAG ACC CTC AGA ACT GAA GCA GAC TCC ATC TCA GGA AGA GTG GAC CGT CTT GAA AGG 



N 



D 



K 



GAG GTA GAC TAT CTG GAG ACA CAG AAC CCA GCT TTG CCC TGT GTA GAG CTG/ GAT GAG AAG 



VTGGPGA^KGKGRRNEKYDMV 
GTG ACT GGA GGT CCT GGA GCC AAA GGC AAG GGC CGA AGA AAT GAG AAA TAC GAT ATG GTG 



72 
246 

92 
306 

112 

366 

132 
426 



T D C S Y T V A Q V R.S M K I L K R F G 
ACG GAC TGT AGC TAC ACA GTC GCT CAG GTG AGG TCA ATG AAG ATC CTG AAG CGG TTT GGT 

G S A V> G L W T K D P L G P a' E K I Y V L 
GGT TCA^/Glj GGC CTA TGG ACC AAG GAT CCG CTG GGG \CCA GCA' GAG AAG ATC TAC GTG TTA 



BGTQNDTAFVFPRLRDFTLA 
GAC GGC ACC CAG AAC GAC ACG GCT TTT GTC TTC CCA AGG CTG CGT GAC TTC ACC CTT GCC 



152 
486 

172 
546 

192 
606 



MAARKASRIRVPF PWVGTGQ 212 
ATG GCT GCC CGG AAA GCT TCC CGA ATT CGG GTG CCC TTC CCC TGG GTA GGC ACG GGG CAG 666 



LVYG'GF LYYARR P PGG PGGG 
CTG GTG TAC GGT GGC TTC CTT TAT TAT GCT CGA AGG CCT CCT GGA GGA CCT GGA GGG GGT 

GELENTLQLIKFHL ANRTVV 
GGT GAA TTG GAG AAC ACT CTG CAG CTG ATC AAA TTT CAC TTG GCA AAC CGA ACA GTG GTG 

DSSVFPA ESLIPPYGLTADT 
GAT AGC TCA GTG TTC CCT GCA GAG AGC CTG ATA CCC CCC TAC GGC CTG ACA GCA GAT ACA 

YID LAADEEGLWAVYATRDD 
TAT ATC GAC CTG GCA GCT GAT GAG GAG GGC CTG TGG GCT GTC TAT GCC ACT CGA GAT GAT 

D RHLCLAKLD PQTIiDTEQQW 
GAC AGG CAT TTG TGT CTA GCC AAG TTA GAC CCA CAG ACA CTT GAC ACA GAG CAG CAG TGG 



P 



E 



N 



E 



232 
726 

252 
786 

272 
846 

292 
906 

312 
966 

332 
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GAC ACA CCA TGT CCC AGA GAG AAC GCA GAG GCT GCG TTT GTC ATC TGT. GGG ACC CTG TAC 1026 

VVYNTRPASRARIQCSFDA-S 352 

GTT GTC TAT AAC ACC CGC CCT GCC AGT AGG GCT CGT ATT CAG TGT TCC TTC GAT GCC AGT 1086 

GTLAPERAALSYF p R R Y G. A H 372 

GGT ACT CTC GCC CCT GAA AGG GCA GCA CTC TCC TAT TTT CCA CGC CGA TAT GGT GCC CAT 1146 

ASLRYNPRERQLYAWDDGYQ 392 

GCC AGC CTT CGC TAT AAC CCC CGT GAG CGC CAG CTG TAT GCC TGG GAT GAT GGC TAC CAG 1206 

IVY KLEMKKKEEEV * 407 

ATT GTC TAC AAA TTG GAG ATG AAG AAG AAG GAG GAG GAA GTT TAA 12 51 

GCAGCTAGCCTTGTGCTCTTGATTCTTATGCCCAGAC^TTTAT 1 3 3 0 

CGAAGGCCAGTGGTGGT AGCTCATAT ACCCT AATTTCTAAAGG ACAACCAAATTCTCAAGCCCCTCTGTTTTATGCAG A 1409 

ACTCCAGATCCTGGGTAGCATTTTAGAACT^ 1488 

AGTTTAGTTCCAAACTCAGAGCCCTGTCCTTTGG AG AGGGTCAACCCCAGACAGCAGGCGACAGCATTCTTGCCCTCAG 156 7 

TATGACCGAAGGGAGAGAACTGAGAGACAAAGCTGCCCTCCCTCCCTTCCCCC^ 1646 

CCGCACATCACTTTGTATGGTAACAGTTTGCATTAAAAGGAAAACCCAC^^ 1721 
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n> win r.tt* 




| I | I | I | « | , | I | • , I | , , , | I | I | I | I | I , I | I | I , I | I | 

1 41 81 121 161 201 241 281 321 361 401 

>mT257 

MGPSAPLLLLFFLSVmSPLQGQQHHLVEYMERRU^ 

K^PLLEVA£KERETLRTEADSISGRVDRIjEI^VDYLETQNPALPCVELDEKVTGGPGA^ 
GKGRRNEKYDMVTDCS YTVAQVRSMKI LKRFGGSVGLWTKDPLGPAEKI YVLDGTQNDTA 
FVFPRLRDFTLAMAARKASRIRVPFPWGTGQLVYGGFLYYA 
LIKFHLANRTVVDSSVFPAESLIPPYGLTADTYIDLAADE^ 
LDPQTLDTEQQWDTPCPRENAEAAFVICGTLYV^^ 
ALSYFPRRYGAHASLRYNPRERQLYAWDDGYQIVYKLEMKKKEEEV ' 
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ALIGN calculates a global alignment of two sequences 
version 2.0uPlease cltet Myers and Miller, CABIOS (1969) 

> HT257 a. a. 406 aa va. 

> «T257a.a. 406 aa 

acorlng matrix: paml20.mat, gap penalties: -12/-4 

94.1% Identity; Global alignment score: 2097 

10 20 30 40 SO* 60 70 

inputs HGPSTPLLILFLLSWSGPLQGOOKHLVEYMERRIJU^^ 

. . • . ■ • « ■ m • • • ■••••••••••••■••••■■•••••••••••••S5!SSS:SSSSSSSSSSS*S* 

KGPSAPUJXFFI^WTGPLQGOQHHLVEYMERRIJ^^ 

10 20 30 40 50 60 70 

60 90 100 110 120 130 140 

inputs K£ REALRTEADT I SGR VDRL E REVD YLETQN PAL P CVE FDE KVTGG PGT KG KG RRNE KYDMVTDCG YT I S 

JCER£TTI^TEADSISGRVDRI*EREVDY1jETQNPALPCVEI^ 

SO 90 100 110 120 130 140 

150 160 170 160 190 200 210 

inputs QVRSMKILKRFGGPAGLOTIODPLGQTEKIYVLDGT^ 

QVRSKXJLlOlFCWSVGLWTKDPLGPAEiaYV^ 

150 160 170 180 190 200 210 

220 230 240 250 260 270 280 

inputs GQLVYGGFLYFARRP PGRPGGGG EMEOTl^LI KTHLAt^TVVDS SVFPAEGLI P P YGLTADTYIDLAADE 

■ ■•••••«•* •••••• • ••••• •■•■i4iii;<((ii;;i;j!;i!J.2'!SJi{!!t!!!t«!a>i 

GQLtVYGGFLYYAARPPGGPGGGGELEKTLQLI KFHIJWRTVVDS SVFPAES LI PP YGLTADTYIDLAADE 
220 230 240 250 260 270 280 

290 300 310 320 330 340 350 

inputs EG LKAVYATREDDRHLCLAKLD PQTLDTEQQWDT PC PREHAEAAFVI CGTL YVVYNTRPAS RARIQCS FD 

EG L W AVY ATRDDDRHLC LAKLD PQTLDT EQQWDT PCPRENAEAAFVI CGTLYWYNTRPASRAR I QCS FD 
290 300 310 320 330 340 350 

360 370 380 390 400 

inputs ASGTLTPERAALPYFPRRYGAHASIJIYNPRERQLYAWDDCT 

ASGTLAPERAALSYFPRKYGAHASLRYNPRERQLYAVroDGYQ 

360 370 380 390 400 
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ALXGH calculates a global alignment of two sequences 
version 2.0uPlease cltct Myers and Killer, CABIOS (1989) 

> hT2S7 a, a. 406 aa vs. 

> Patent Protein W75120 - (untitled) 355 aa 
scoring matrix* paml20.mat, gap penalties: -12/-4 

86.9% Identity; Global alignment score: 1681 

10 20 30 40 50 60 70 

inputs MGPSTPLLILFTJ^WSGPLQGQQHHLVEYMEIUU^ 

MGPSTPLLILFIXSWSGPI^G<X}HHLVEYMERJUJUU,EER^ 

10 20 30 40 50 60 70 

80 90 100 110 120 130 140 

inputs KEREALRTEADTISGRVDRLERET/DYLCT 

KEREAIJITEADTISGRVDRI<EREVDYLETQNPALPCVEFDEKVTGGPGTKGKGRJ^ 

80 90 100 110 120 130 140 

150 160 170 180 190 200 210 

inputs QVRSMKILIGIFGGPAGLOTKDPLGQTEKIYVIJXJT 

I::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: 
OVR£MKlLKRFGGPAGLVn , KDPLCQTEKJYVLIXrrQNOT 

150 160 170 160 190 200 210 

220 230 240 250 260 270 280 

inputs GQLVYGGFLYFARRPPGRPGGGGEMENTLQLI KFHIJuTOTVVDSSVFPAEGLI PPYGLTADTYI DLAADE 

: : i :::: i J::::::::::::::::::::::::;:::::;:::::::::::::::::::: : : { ::::::: : 
GQLVYGGFLYFARRPPGRPGGGGEMENTLQLI KFHLANRTWDSSVFPAEGLI PPYGLTADTYI DLAADE 
220 230 240 250 260 270 280 

290 300 310 320 330 340 350 

i npu 1 3 EGLWAVYATREDDRHLCLAKLD PQTLDTEQOWDTPCPRENAEAAFVI CGTLYWYNTRPASRARI QCS FD 

EGLWAVYATREDDRHIX7t*AJCL^PQTL»DTEQQVTOTPCP 

290 300 310 320 330 340 350 

360 370 380 390 400 

inputs ASGTLTPERAALPYFPRRYGAHASLRYNPRJERQLYAWDDGYQIVY1GL»EMRKKEEEV 

: : : . 

ASGPX - 



FIG. 14 



23/61 



WO 00/78808 



PCT/US00/16883 



ALIGN calculates a global alignment of two sequences 

version 2.0uPlease cite: Myers and Miller. CABIOS (1989) 
> I?*? * a. 1832 aa vs. 

* ac0214* 1925 aa 

scoring matrix: paml20.mat, gap penalties: -12/-4 
93. S% identity; Global alignment score: 9158 

.•■ 

10 20 30 40 50 60 

i npu t s GTCG ACCC ACGCGTCC GCGG ACGCGTGGG - -TG AGGGG AAG AGGCTG ACTGTACGTTCCTTCT ACTC 

♦•• • • • ... . . .» . ... ... ._>•...« ........... ........ . .. . 

... ... * . ... • ... ... ..a.. ...... .......-•.»•« .... - ..p. 

CCC-CGCCTCCAAAGCTAACCCTCGGGCTTGAGGGGAAGANGCTGACTGTACGTTCCTTCTACTC 

10 20 30 40 50 60 

70 80 00 100 110 120 130 

i npu t s TGGCACCACTCTCC AGGCTGCC ATGGGGCCCAGCACCCCTCTCCTC ATCTTGTTCCTTTTGTC ATGGTCG 

TGGCACCACrCTCCAGGCTGCCATGGGGCCCACXrACCCCTCTCCTCATCT^ 

70 80 90 100 110 120 130 

140 150 160 170 180 190 200 

i npu t s GG ACCCCTCC AAGG AC AGCAGC ACC ACCTTGTGGAGTACATGGAACGCCGACTAGCTGCTTTAGAGGAAC 

GGACCCCTCCAAGGACAGCAGCACCACCTTGTGGAGTACATGGAACGCCGACTAGCTGCTTTAGAGGAAC 
140 150 160 170 180 190 200 

210 220 230 240 250 260 270 

i npu t s GGCTGGCCC AGTGCC AGGACC AGAGTAGTCGGC ATGCTGCTGAGCTGCGGGACTTCAAGAACAAGATGCT 

.......••............................•••.•........•............••.•.*. 

._« «..••«•..•«...•••••«» 

GGCTGGCCCAGTGCCAGGACCAGAGTAGTCGGCATGCTGCTGAGCTGCGGGACTTCAAGAACAAGATGCT 
210 220 230 240 250 260 270 

280 290 300 310 320 330 340 

i npu t s -GCCACTGCTGGAGGTGGCAGAGAAGGAGCGGGAGGCACTCAGAACTGAGGCCGACACCATCTCCGGGAG 

NGCCACTCXrTGGAGGTGGCAGAGAAGGAGCGGGAGGCACTCAGAACTG 

280 290 300 310 320 330 340 

350 360 370 380 390 400 410 

inputs AGTGGATCGTCTGGAGCGGGAGGTAGACTATCTGGAGACCCAGAA 

*••••■•••••••••*••*•■••••«••««••••«)••. •••••••••••••■■••«••*•••*••••■•• 

"■>••■•••••««*•••••«>•«••>••••••■•«••••■•••«•••■•••«•••••••*•••*••>«•••** 

AGTGGATCGTCIGGAGCGGGAGGTAGACTATCTGGAGAC^ 

350 360 370 380 390 400 410 

420 430 440 450 460 470 480 

inputs GATGAGAAGGTGACTGGAGGCCCTGGGACCAAAGGCAAGGGAAGAAG 

GATGAGAAGGTGACTGGAGGCCCTGGGACCAAAGGCAAGGGAAGAAGG 

420 430 440 450 460 470 480 

490 500 510 520 * 530 540 550 
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inputs CAGACTGTTCCTACACAATCTCTCAAGTGAGATC 

:::::::::::::::::: s : $ ::::::::::$::::::::: : : s : : : i ::::::: : : : : : : : : : : : : : : s_ 
CAGACTGTGCCTACACAATCTCTCAAGTGAGATCAATGAAGATTCTGAAGCGATT^ 

490 500 510 520 530 540 550 



560 570 580 590 600 610 620 

i npu t s TCTATCGACCAAGG ATCCACTGGGGCAAAC AGAG AAGATCTACCTGTTAGATGGGACACAG AATG ACAC A 

::::::::::::::::: : : : : :::::::::::::::::::::::::::::::::::::::::::: : : - • 
TCTATGGACCAAGG ATCCACTGGGGCAAAC AG AG AAGATCTACGTGTT AG ATGGG AC AC AG AATG AC ACA 
560 570 580 590 600 610 620 

630 640 650 660 670 680 690 

i npu t s GCCTTTGTCTTCCC AAGGCTGCGTGACTTC ACCCTTGCCATGGCTGCCCGG AAAGCTTCCCG AGTCCGGG 

GCCTTTGTCTTCCCAAGGCTGCGTGACTTCACCCTTGCCATGGCTGCCCGGAAAGCTTCCCGAGTCCGGG 
630 640 650 660 670 680 690 

700 710 720 730 740 750 760 

i npu t s TGCCCTTCCCCTGGGTAGGCACAGGGCAGCTGGTATATGGTGGCTTTCTT^ 

TGCCCTTCCCCTGGGTAGGCACAGGGCAGCTGGTATATGGTGGCTTTCTTTATTTTGCTCGGAGGCCTCC 
700 710 720 730 740 750 760 

770 780 790 800 810 820 830 

inputs TGGAAGACCTGGTGGAGGTGGTGAGATGGAGAACACTTTGCAGCTAATCA 

TGGAAGACCTGGTGGAGGTGGTGAGATGGAGAACACTTTGCAGCTAATCAAA 

770 780 790 800 810 820 830 

840 850 860 870 880 890 900 

inputs ACAGTGGTGGACAGCTCAGTATTCCCAGCAGAGGGGC^ 

:::::::::::::::::::::::::::::::::::::::::::::: :s s - ' • : * : : : " : : : : : : : : : : : : 
. ACAGTGGTGGACAGCTCAGTATTCCCAGCAGAGGGGCTGATCCCCCCCTACGGCTTGACAGCAGACACCT 

840 850 860 870 880 890 900 

910 920 930 940 950 960 970 . 

inputs AC^TCGACCTGGCAGCTG ATGAGGAAGGTCTTTGGGCTGTCTATGC 

ACATCGACCTGGCAGCTGATGAGGAAGGTCTTTGGGCTGTCT 

910 920 930 940 950 960 970 

980 990 1000 1010 1020 1030 1040 

inputs GTGTCTGGCCAAGTTAGATCCACAGACACTGGACACAGAGCAGCA 

GTGTCTGGCCAAGTTAGATCXACAGACACTGGAC^ 

980 990 1000 1010 1020 1030 1040 

1050 1060 1070 1080 1090 1100 1110 

inputs AATGCTGAGGCTGCCITTGTCATCTGTGGGAC 

:::::::: : z : :::::::: : : : : : : : : s : : : : : ? ' = • s : : : • : : s : s : 
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AATCXrrcAGGCTCCCTT^^ 

1050 1060 1070 1080 1090 1100 1110 - 

1120 1130 1140 1150 1160 1170 1180 

inputs CCCGCATCCAGTGCTCCTTTGATGCCAGCGGCACCCTGACCCCTGAACGG 

CCCGCATCCAGTGCTCCTTTGATGCCAGCGG-ACCCTGACCCCTGAACGG<^ 

1120 1130 1140 115u 1160 1170 1180 

1190 1200 1210 1220 1230 1240 1250 

inpu t s CCCCAGATATGGTCCCC ATGCC AGCCTCCGCTATAACCCCCG AGAACGCCAGCTCTATGCCTCGC ATGAT 

CCGCAGATATGGTGCCCATGCCAGCCTCCGCTATAACCCCCGAGAACGCCAGCTCTATGCCTGGGATGAT 
1190 1200 1210 1220 1230 1240 1250 

1260 1270 1280 1290 1300 1310 1320 

inpu t s GGCTACCAGATTGTCTATAAGCTGGAGATG AGGAAGAAAGAGGAGG AGGTTTGAGGAGCTAGCCTTGTTT 

CGCTACCAGATTGTCTATAAGCTGGAGATGAGGAAGAAAGAGGAGGAGGTTTGAGGAGCTAGCCTTGTTT 
1260 1270 1280 1290 1300 1310 1320 

1330 1340 1350 1360 1370 1380 1390 

i npu t s TTTGCATCTTTCTCACTCCCATACATTTATATTATA 

TTTGCATCTTTCTCACTCCCATACATTTATATTATATCCCCACT 

1330 1340 13S0 1360 1370 1380 1390 

1400 1410 1420 1430 1440 1450 1460 

inputs TGTGGGCCAGTTGTGGCTCAAATCCTCTATATTTTTAG 

TGTGGGCCAGTTGTGGCTCAAATCCTCTATATTTTO 

1400 1410 1420 1430 1440 1450 1460 

1470 1480 1490 1500 1510 1520 1530 

inputs TTTCATACGGAACTCCAGATCCTGAGTAATCCTTTTAGAGCCCGAAG 

TTTCATACGGAACTCCAGATCCTGACTAATCCTTTTAGAGCCCG 

1470 1480 1490 1500 1510 1520 1530 

1540 1550 1560 1570 1580 1590 1600 

inputs CCTGCTCTCCTGCCCCATGTCAAOU^ 



CCTGCTCTCCTCCCCCATGTGAACAAATTTCAGGCT 

1540 1550 1560 1S70 1580 1590 1600 

1610 i620 1630 1640 1650 1660 1670 

inputs ATGCGGGCAGGCCCAGGGAGCAGGCAGCAGTGTTCTTCCCCTC^ 



ATGCGGGCAGGCCCAGGGAGCAGGCAGCAGTGTTCTTCCCCTpAGAGTG 

1610 1620 1630 1640 16S0 1660 1670 
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1680 1690 1700 1710 1720 1730 1740 

inputs AGGAGACGTCCAGCTCTGTCCTCTCTTCC^ACTCCTCCCTTCAGTGTCCTGAGGAACAGGACTTTCTCC 

AGGAGACGTCCAGCTCTGTCCTCTCTTCCTCACTCCTCCCTTCAGTGTCCTCAGGAACA 

1680 1690 1700 1710 1720 '1730 1740 

1750 1760 1770 1780 1790 1800 1810 

i npu t S AC ATTGTTTTGTATTGC AAC ATTTTGCATTAAAAGG AAAATCC ACTGCT AAAAAAAAAAAAAAAAAAAAA 

ACATTGTTTTGTATTGCAACATTTTGCATTAAAAGGAAAATCCANAAAAAAAA 

1750 1760 1770 1780 1790 1800 1810 

1820 1830 
inputs AAAAAAAAGG GCGGCCGC 



AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

1820 1830 1840 1850 1860 1870 1880 



inputs 



TCGTCTTCTCGCAGCCGTACCCrrTCTGTCGTCTTCTCGCAGCC 
1890 1900 1910 1920 
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ALIGN calculates a global alignment of two sequences 
version 2.0uPlease cite: Myers and Miller, CABIOS (1989) 

> mT2S7a.a. 406 aa vs. 

> Patent Protein W75120 - (untitled) 355 aa 
scoring matrix: paml20.mat, gap penalties: -12/-/4 

81.8% identity; Global alignment score: 1599 

10 20 30 40 50 60 70 

inputs MGPSAPLLLLFFLSWTGPLQGQQHHLVEYMERRLAALEERIAQCQDQSSRHAAELRDFKNKMLPLLEVAE 

::::.:::.::.:::.::::::::::::::::::::::::::::::::::::::::::::: : : : : : : : : : 
MGPSTPLLILFLLSWSGPLQGQQlfflLVEYMERRLAAL^ 

10 20 30 40 50 60 70 

80 90 100 110 120 130 140 

inputs KERETLRTEADSISGRVDRLEREXTOYLETQNPALPCVX 

::•••*::•::. ::*■•:::«■: a:::::*:::::::*.:::::.... «■»■«■•••■••••■••••••• 

KEREALRTEADTISGRVDRLEREV1)YLETQNPALPCV^FDEKVTGGPGTKGKGRRNEKYDMVTDCGYTIS 
80 90 100 110 120 130 140 

150 160 170 180 190 200 210 

inputs QVRSMKILKRFGGSVGLWTKDPLGPAEKI YVTjDGTQNDT^ 

QVRSMKILKRFGGPAGLWTKDPLGQTEKIYVLDGTQOTT^ 

150 160 170 180 190 200 210 

220 230 240 250 260 270 280 

Inputs GQLV^GGFLYYARRPPGGPGGGGELENTLQLIKFHIJ^IRTVVDSSVFPAESLIPPYGLTADTYIDLAADE 
■«.•»•■•■• •••••• ■•••.••■•■•••..■•«••....* "■•■iiiitjiis:::::: 

GQLVTGGFLYFARRPPGRPGGGGEMENTLQLIKFHLANRTWDSSVFPAEGLIPPYGLTADTYIDLAADE 
220 230 240 250 260 270 280 

290 300 310 320 330 340 350 

nputs EGLWAVT ATRDDDRHLCLAKLDPQTLDTEQQWDTPCPRENAEAAFVI CGTLYWYNTRPAS RAR IQCS FD 

■ • • ■ ... .... ............ • • •■«»• 

EGLWAWATREDDRHLCLAKLDPQTLDTEQQWDTPCPRENAEAAFVICGTLYWYNTRPASRARIQCSFD 
290 300 310 320 330 340 350 

360 370 380 390 400 

nputs ASGTLAPERAALSYFPRJ^YGAHASLRYNPRERQLYATODGYQIVVKLEMKKKEEEV 

ASGPX - - 
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ALIGN calculates a global alignment of two sequences 
version 2.0uPlease cite: Myers and Killer, CABIOS (1989) 

> «T257 n,a. 1721 aa vs. 

> Patent Nucleotide V34217 - (untitled) 1690 aa 
scoring matrix: paml20.mat, gap penalties: -12/-4 

76,2% identity; Global alignment score: 6493 

10 20 
inputs GT CG ACCCAC GCGTCC - GACTTAAGG - 



GAGCAGGAGAGAAGGCACCGCCCCACCCCGCCTCCAAAGCTAACCCTCGGGCTTGAGGGGAAGAGGCTGA 
10 20 30 40 50 60 70 

30 40 50 

i npu CS CTG CC ATGGGGCC C AGTG CTCCTCTG CTG C TC CT 



CTGTACGTTCCTTCTACTCTGGCACCACTCTCCAGGCTGCCATGGGGCCCAGCACCCCTCTCCTCATCTT 
80 90 100 110 120 130 140 

60 70 80 90 100 110 120 

i npu t S CTTCTTTTTGTCATGGACGGGACCCCTTCAGGGACAGCAGCACCACCTTGTGGAGTACATGGAACGCCGA 

GTTCCT1TTGTCATGGTCGGGACCCCTCCAAGGACAGCAGCACCACCTTGTGGAGTACATGGAACGCCGA 
150 160 170 180 190 200 210 

130 % 140 150 160 170 180 190 

inputs CTAGCTGCCTTAGAGGAACGGCTGGCCCAATGCCAGGATCAGAGTAGTCGGCATGCTGCCGAGCTTCGGG 

CTAGCTGCTTTAGAGGAACGGCTGGCCCAGTGCCAGGACCAGAGTAGTCGGCATGCTGCTGAGCTGCGGG 
220 230 240 250 260 270 280 

200 210 220 230 240 250 260 

i npu t s ACTTCAAAAACAAGATGTTGCCTCTCCTGGAGGTGGCAGAGAAGGAGCGGGAGACCCTCAGAACTGAAGC 

ACTTCAAGAACAAGATGCTGCCACTGCTGGAGGTGGCAGAGAAGGAGCGGGAGGCACTCAGAACTGAGGC 
290 300 310 320 330 340 350 

270 260 290 300 310 320 330 

i npu 1 5 AG ACTCC ATCTC AGG AAG AGTGG ACCGTCTTG AAAGGG AGGT AG ACT ATCTGGAG AC ACAG AACCC AG CT 

CX3ACACCATCTCCGGGAGAGTGGATCGTCTGGAGCGGGAGGTAGACTATCTGGAGACCCAGAACCCAGCT 
360 370 380 390 400 410 420 

340 350 360 370 380 390 400 

Lnpu C S TTGCCCTCTGTAGAGCTGGATGAGAAGGTGACTGGAGGTCCTGGAGCCAAAGGCAAGGGCCGAA 

CTGCCCTGTGTAGAGTTTGATGAGAAGGTGACTt^AGGCCCTGGGACCAAAGGCAA 

430 440 4S0 460 470 480 490 

410 420 430 440 450 460 470 

nputs AGAAATACGATATGGTGACGGACTGTAGCTACACAGTCGCTCAGGTGAGGTCAATC 

22Z2_2S2?S*"9!***«* • ••••••• mm •••• • m m m m ■•*•••«•••« 

AGAAGTACGATATGGTCACAGACTGTGGCTACACAATCrCTCAAGTGAGATCAATC 

S00 510 520 530 540 550 560 

4fi 0 490 500 510 520 S30 540 

nputs GTTTGGTGGTTCAGTTGGCCTATGGACCAAGGATXXX3CTGGG 

::x s:i :: j :::::::::: : j : 
ATTTGGTGGCOCAGCTCGTCTATGGACCAAGGATC^ 

570 580 590 600 610 620 630 

550 S60 570 580 590 600 610 

nputs OQCACCCAQAACX3ACAOQQCTTTTQTCTTCCCAAQGCTOOQ 

tt n inn ttnt.ti irsxssisstiisiissttsssiiittsiiisssssiss'ssstttstts 

640 650 660 670 660 690 700 

620 £30 640 650 660 670 680 • 

iputs AAOCTTTCCQAATTCCKKrrtKXXTrCCCCT 

tMlttMttt.l tt u 1 1 : tt 1 1 : i u 1 4 1 1 1 1 1 1 1 | t , | I I 1 1 it m t : , u itssts'ss stt st 
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AAQCTTCCCOAOTCCOOOTOCCCTTCCCCTt^ 

710 720 730 740 750 760 "0 

€90 700 710 720 730 740 750 

inputs TTATGCTCGAAGGCCTCCT^^ 

2 i i 5 : 2 : 2 2 2 .:: 2 2 2 2 1 2 2 : : 2: |! : : 2 : 2 *- 

TTTTG CTCGG AGGCCTCCTGO AAG ACCTGGTGG AGOTGOTO AO ATGGAGAACACTTTGCA^ CTAATCAAA 
780 790 800 810 820 830 840 

760 770 780 790 800 810 820 

i npu t s TTTCACTTGGCAAACCGAACAGTGGTGG 

TTCCACCTCGCA^ 

850 860 870 880 890 900 910 

830 840 850 860 870 880 890 

i npu t s GCCTG AC AGCAGATACATATATCG ACCTGGCAGCTG ATG AGGAGGGCCTGTGGGCTGTCTATGCCACTCG 

GCTTGACAGCAGACACCTACATC 

920 930 940 950 960 970 980 

900 910 920 930 940 950 960 

i npu t s AGATGATGACAGGCATTTGTGTCTAGCCAAGTTAG ACCCACAGACACTTGACACAGAGCAGCAGTGGG AC 

GGAGGATGACAGGCACTTX3 

990 1000 1010 1020 1030 1040 1050 

970 980 990 1000 1010 1020 1030 

inputs ACACCATGTCCC AGAGAG AACGCAG AGGCTGCGTTTGTC ATCTGTGGG ACCCTGTACGTTGTCTATAACA 

ACACCATGTCCCAGAGAG 

1060 1070 1080 1090 1100 1110 H20 

1040 1050 1060 1070 1080 1090 1100 

i npu t s CCCGCCCTGCCAGTAGGGCTCGTATTCAGTGTTCCTTCGATGCCAGTGGTACTCTCGCCCCTG AAAGGGC 

CCCGTCCTCCCAGTCGOTCCCGC ATCCAGTGCTCCTTTGATGCC AGCGG - ACCCTGACCCCTGAACGGGC 
1130 1140 1150 1160 1170 1180 

1110 1120 1130 1140 11S0 1160 1170 

inputs AGCACTCTCCTATTTTCCACX3CCGATATGGTGCCCATGCCAGCCrT(X3 

AGOOTCCOTOT 

1190 1200 1210 1220 1230 1240 1250 

1180 1190 1200 1210 1220 1230 1240 

inputs CTGTATGCCTGGGATGATGGCTACCAGATTGTCTACAAAT 

CTCTATCCCIX^ATC^ 
1260 1270 1280 1290 1300 1310 1320 

1250 1260 1270 1280 1290 1300 _ 

inputs AAGCAGCTAGCCTTGTGCT- - - CTTG ATTCTTATGCCCAGACATTTATATT CCTGTGAGCTCTCC 

. : : :::::::::: t : : 2.2 .:22s : : : : 2 : s : : : : : 2 : : 2 : 2 . 2 - 1 » 2 

GAGGAGCTAGGCTTGTTTTTTGCATCTTTCTCACT 
1330 1340 1350 1360 1370 1380 1390 

1320 1330 1340 1350 1360 1370 

inputs TGCAGTTC^TCCTTCAAAACGAAGGCCAGTGGTGGT AGGACAA 

22 , :::: t:::::t. 2 22 a a 22::.:: 2 22 i** a 2. 2 5 ''III 

TOTTCCTCATTCTTCAAAT-QTGGG<X!AGTTGTGG CTCAAATCCTCTATATTTTTAGCCAATGGCAA 

1400 1410 1420 1430 144X) 1450 1460 

1380 1350 1400 1410 1420 1430 M 

inputs CCAAATTCTCA-AGCCCCTCTGTTTTATGCAOAACTC -CATTTTAGAACTGAACAG 
tttitttt • m tn itttt si. t,iisstt tsttt ttsi.tss. t nn ut.t .« «2 
TCAA Ari C'fT T C A O C ' ltXTI ' lUl riCATACQQAACTCCAQATCCTQAQTAATCCTTTTAQAQCCCQA^AG 
1470 1480 1490 1500 1510 1520 1S30 

1450 1460 14\0 1480 1490 
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inputs CAAACAAACACCCTAAAT- CTTCACTCCTGCCTTATGTCCACAAAGTT -TAGTT CC 

AGTCAAAACCCTCAATGTTCCCTCCTGCTCTCCTGCrc 

1540 1550 1560 1570 1580 1590 1600 

1500 1510 1520 .1530 1540 1550 1560 

i npu t s AAACTCAGAGCCCTGTCCTTTGG AGAGGGTCAACCCCAGACAGCAGGCG ACAGCATTCTTGCCCTCAGTA 

• ■ ... • • • « .... . ... ..... ....... . I I ! I ! 

.... ...... •.*..... ... ••* . * • ...... ..»••...••.. ••••«• 

AGACCCAGGGCTCTAACCTTGTATGCGGG-CAGGCCCAGGGAGCAGGCAGCAGTGTTCTTCCCCTCAGAG 
1610 1620 1630 1640 1650 1660 1670 

1570 1580 1590 1600 1610 1620 

i npu t s TG ACC - GAAGGG AG AG AACTC AG AG A - CAAAGCTGCCCTC CCTCCCTTCCCCCTCC AGTG 

:::: : ::. ::: :::: :::: ' : : :::: ::::: 

TGACTTGGGGAGGGAGAAATAGGAGGAGACGTCCAGCTCTGTCCTCTCTTCCTCACTCCTCCCTTCAGTG 
1680 1690 1700 1710 1720 1730 1740 



1630 1640 1650 1660 1670 1680 1690 

inputs TAGGGGAGAATGGGGCTTTCCCC AC ATC ACTTTGTATGGTAACAGTTTGCATTAAAAGGAAAACCCAC - - 
• • • » • ..... ...... ....... i •••• i » . i • • • • # • ; t j j i • • j jrii 

. ••«•*• ......... ...... • *■••••* * .... ...••.»•»......••• 

TCCTGAGGAACAGGACTTTCTCCACATTGTTTTGTATTGCAACATTTTGCATTAAAAGGAAAATCCACTG 
1750 1760 1770 1780 1790 1800 1810 

1700 1710 1720 
inputs CAAAAAAAAAAAAAAAGGG CGGC - CG 



CAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAACGGCACGAGGGGGGGTCCCGTACCCAATNGCCCTCA 
1820 1830 1840 1850 1860 1870 1880 



inputs C- 



CATGCAT 
1890 
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GTCGACCCACGCOTNCNTCCAGCGTNCGGAGCCGCCCTGGGTGTCAGCGGCTCGGCTCCCGCGCACGCTCCGQCCGTCG 7 9 

M l 

CGCAGCCTCGGCACCTGCAGGTCCGTGCGTCCCGCGGCTGGCGCCCCTGACTCCGTCCCGGCCAGGGAGGGCC ATG 155 

.*• 

ISLPGPLVTNLXRFLFLGLS 21 

ATT TCC CTC CCG GGG CCC CTG GTG ACC AAC TTG NTG CGG TTT TTG TTC CTG GGG CTG AGT 215 

ALAPPS RAQLQLHLPANRLQ 41 

GCC CTC GCG CCC CCC TCG CGG GCC CAG CTG CAA CTG CAC TTG CCC GCC AAC CGG TTG CAG 2 75 

AVEEGESGASAWYTLHREVS 61 

GCG GTG GAG GAG GGG GAA AGT GGT GCT TCA GCA TGG TAC ACC TTG CAC AGG GAG GTG TCT 335 

SSQPWEV PFVMWFFKQKEKE 81 

TCA TCC CAG CCA TGG GAG GTG CCC TTT GTG ATG TGG TTC TTC AAA CAG AAA GAA AAG GAG 395 

D.QVLSYI NGVTTSKPGVSLV 101 

GAT CAG GTG TTG TCC TAC ATC AAT GGG GTC ACA ACA AGC AAA CCT GGA GTA TCC TTG GTC 4 55 

YSMPSRN. LSLRVEGLQEKDS 121 

TAC TCC ATG CCC TCC CGG AAC CTG TCC CTG CGG GTG GAG GGT CTC CAG GAG AAA GAC TCT 515 

GPYSCS VNVQDKQGKSRGHS 141 

GGC CCC TAC AGC TGC TCC GTG AAT GTG CAA GAC AAA CAA GGC AAA TCT AGG GGC CAC AGC S75 

IKTLELNVLVPPAPPSCRLQ 161 

ATC AAA ACC TTA GAA CTC AAT GTA CTG GTT CCT CCA GCT CCT CCA TCC TGC CGT CTC CAG 635 

GVPHVGANVTLSCQSPRSK P 181 

GGT GTG CCC CAT GTG GGG GCA AAC GTG ACC CTG AGC TGC CAG TCT CCA AGG AGT AAG CCC 695 

AVQYQW D RQL PSFQTFFAP A 201 

GCT GTC CAA TAC CAG TGG GAT CGG CAG CTT CCA TCC TTC CAG ACT TTC TTT GCA CCA GCA 755 

LDVIRGSLSLTNLSSSMAGV 221 

TTA GAT GTC ATC CGT GGG TCT TTA AGC CTC ACC AAC CTT TCG TCT TCC ATG GCT GGA GTC 815 

YVCKAHN EVGTAQCNVTIi,E V 241 

TAT GTC TGC AAG GCC CAC AAT GAG GTG GGC ACT GCC CAA TGT AAT GTG ACQ CTG GAA GTG 875 

STGPGAAVVAEAVVGTLVG L 261 

AGC ACA GGG CCT GGA GCT GCA GTG GTT GCT GAA GCT GTT GTG GGT ACC CTG GTT GGA CTG 935 

GLLAG LV LLYHRRGKALEE P 281 

GGG TTG CTG GCT GGG CTG GTC CTC TTG TAC CAC CGC CGG GGC AAG GCC CTG GAG GAG CCA 995 

AN D I K E D A I A P R ? L PW P K S S; . 301 
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GCC AAT GAT ATC AAG GAG GAT GCC ATT GCT CCC CGG ACC CTG CCC TGG CCC AAG AGC TCA 105S 

DTISKNGT.LSSVTSARALR-P 321 

GAC ACA ATC TCC AAG AAT GGG ACC CTT TCC TCT GTC ACC TCC GCA CGA GCC CTC CGG CCA 1115 

PHGPPRPGALTPT PSLSS Q A 341 

CCC CAT GGC CCT CCC AGG CCT GGT GCA TTG ACC CCC ACG CCC AGT CTA TCC AGC CAG GCC 1175 

LPS PRHAHDRWGP PSTN I PH 361 

CTG CCC TCA CCA AGA CAT GCC CAC GAC AGA TGG GGC CCA CCC TCA ACC AAT ATC CCC CAT 12 3 5 

PWWGFFLW L* 371 
CCC TGG TGG GGT TTT TTC CTT TGG CTT TGA 1265 

GCCGCATGGGTGCTGNGCCTGTGATGGNGCCTGCCC^GAGTCAAGCTGGCTCTCTGGTATGATGACCCCACCACTCATT 13 4 4 

GGCTAAAGGATTTGGGGTCTCTCCTTCCTATAAGGGTCACCTCTAGCACAGAGGCCTGA 14 2 3 

TCCTGACCCTTAGTACTCTGCCCCCACCTCTCITTACTGTGGGAAAACCA 1502 

AGAAGG AGAAGAGGAAGTGGATCTGGAATTGGGAGGAGCCTCCACC(^CCCCTGACTCCTCCTTATGAAGC 1581 

3AAATTAGCTACTCACCAAGAGTG AGGGGCAG AGACTTCCAGTCACTGAGTCTCCCAGGCCCCCTTGATCTGTACCCCA 166 0 

:CCCTATCTAACACCACCCTTGGCTCCCACTCCAG 173 9 

rACTGGGGCAGAGGATAGGGAATCTCTTATTAAAACTAACATGAAATATGTGTTGTTTTCATT 1818 
G ATACATAATGTTTGTATGAGATAAGAAAAAAAAAAAAAAAGGGCGGCCGC 1869 
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K»1W 




, , , , , , , , , , i . i . , . 1 1 1 « i . i « i < i < i > i • i • i • i * 

1 41 81 121 1G1 201 241 281 321 361 



MISLPGPLVTNIOCRFLFLGLSMAPPS^ 

^^Ss^aS^rpphgpprpgai.tptpslssqalpspbhahdrwgppstoip 
hpwwgpflhl 
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GTCXSACCCACGCGTCCGGTGCACA^ 79 

MXLQAGTPETSLL 13 

CCTGAGTACTCCGGGCCAAGGAGGGCC ATG ATT CTT CAG GCT GGA ACC CCC GAG ACC AGC TTG CTG 14 5 

RVLFLGLST LAAFS R AQMEL 33 

CGG GTT TTG TTC CTG GGA CTG AGT ACC CTT GCT GCC TTC TCC CGA GCT CAG ATG GAG TTG 205 

HVPPGLNKL E.AVE G EEVVLP 53 

CAC GTG CCC CCG GGC CTC AAC AAA TTG GAA GCG GTA GAG GGA GAA GAA GTG GTG CTC CCC 265 

AWYTMAREESWS H P R EVP I L 73 

GCC TGG TAC ACG ATG GCA CGG GAG GAG TCG TGG TCC CAC CCC CGG GAG GTG CCC ATC CTG 325 

IWFLEQEGKEPNQVL SYING 93 

ATC TGG TTC TTG GAA CAA GAA GGG AAG GAA CCA AAC CAG GTG TTG TCT TAC ATT AAT GGA 385 

VMTN-KPGTALVHS I S SRNVS 113 

GTC ATG ACA AAT AAA CCT GGA ACA GCC CTG GTC CAC TCT ATC TCT TCA CGG AAT GTG TCC 44 5 

LRLGALQEGDSGTY RCSVNV 133 

CTG CGC CTG GGG GCA CTC CAG GAG GGA GAC TCT GGG ACT TAC CGC TGT TCT GTC AAT GTG 505 

Q KDEGKS I G HS I K S I ELKVL 153 

CAG AAT GAT GAA GGC AAA AGT ATA GGC CAC AGC ATC AAA AGC ATA GAG CTC AAA GTG CTG 56 5 

VPPAPPSCSLQGVPYVGTNV 173 

GTT CCT CCA GCT CCT CCA TCC TGT AGT TTA CAG GGT GTA CCC TAT GTC GGG ACC AAT GTG 625 

TLNCKSPRS KPTAQYQWERL 193 

ACC CTG AAC TGC AAG TCC CCA AGG AGT AAA CCT ACT GCT CAG TAC CAG TGG GAG AGG CTG 685 

APSSQVFFG PA LDAV RGS LK 213 

GCC CCA TCC TCC CAG GTC TTC TTT GGA CCA GCC TTA GAT GCT GTT CGT GGA TCT TTA AAG 745 

LTNLSIAMS GVYVC KAQNRV 233 

CTC ACT AAC CTT TCC ATT GCC ATG TCT GGA GTC TAT GTC TGC AAG GCT CAA AAC AGA GTG 805 

GFAKCNVTLDVMTG S K A A V V 253 

3GC TTT GCC AAG TGC AAC GTG ACC TTG GAC GTG ATG ACA GGG TCC AAG GCT GCA GTG GTC 865 

AGAVVGTFVGIiVLIAGIiVLIf 273 

3CT GGA GCA GTT GTG GGC ACT TTT GTT GGG TTG GTG CTG ATA GCT GGG CTG GTC CTG TTG 925 

YQRRSKTLEEIiANDIKEDAI 293 

CAC CAG CGC CGG AGC AAG ACC TTG GAA GAG CTG GCC AAT GAT ATC AAG GAA GAT GCC ATT 985 

APRTLPWTKGSDTlSKNGTIi 313 

»CT CCC CGG ACC TTG CCT TGG ACC AAA GGC TCA GAC ACA ATC TCC AAG AAT GGG ACA CTT 1045 
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SSVTSARALRppKAAP PRPG 333 

TCT TCG GTC ACC TCA GCA CGA GOT CTG CGG CCA CCC AAG GCT GCT CCT CCA AGA CCT QQC 1105 

TFTPTPSVSS QAb SSPRLPR 353 

ACA TTT ACT CCC ACA CCC AGT GTC TCT AGC CAG GCC CTG TCC TCA CCA AGA CTG CCC AGG 1165 

VDEPPPQAVSLTPGGVSSSA 373 

GTA GAT GAA CCC CCA CCT CAG GCA GTG TCC CTG ACC CCA GGT GGG GTT TCT TCT TCT GCT 122 5 

LSRMGAV P VMVPAQSQAGSL 393 

CTG AGC CGC ATG GGT GCT GTG CCT GTG ATG GTG CCT GCA CAG AGT CAG GCT GGG TCT CTT 12 8 5 

V 395 

GTG TGA 1291 

TAGCCCAGGCACTC ATTAGCT ACATCTGGT ATCTG ACCTTTCTGTAAAGGTCTCCTTGTGGCACAGAGG ACTC^TCTT 1370 

3GGAGGATGCCCACATTCTAGACCTCCAGTCCTTTGCTCCTACCTCCTTCrrATTC 144 9 

?TAAAATCTGGGTCAAAGGACAAAAGGAGGAAATGGACCTGAGGTAGGGGGTTGGGAGTGAG 1528 

rTGCTTCTCCCTGAAGCCAGATGAATGCTGCGGAAGATCGGCTACCCTCCAAGGGCTCTGGAGGAGACTGCC^ 1607 

^ATGCCCCTGGCTCTGTGATCTGTACAACACCCTTATCTAATGCTGTCCTTTGCCGTTCG 1686 

ATAACCTGTCCTGCTGGCTTGGCTGGGTT^ 176 5 

GATGTTTTTGTTTTTATTTTGCAAATTTC^ 1844 

1846 
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I i | i | » | i , i | i | i | i | i | i | i | i | i | i | • | » | i | ■ | • | ■ 
1* 41 81 121 181 201 241 281 321 361 



>roT258 

MILQAGTPETSLLRVLFLGLSTLAAFSRAQMELHVPPGL^ 
EESWSHPREVPILIWFLEQEGKEPNQVLSYINGVMTNK 

EGDSGTYRCSVNVQNDEGKSIGHSIKSIELKVLVPPAPPSCSLQGVPYVGTNVTLNCKSP 
RSKPTAQ YQWERLA P S SQVFFG P ALD AVRG S LKLTNL S I AMS GVYVC KAQNRVG F AKC NV 
TLDVMTGS KAAWAGAWGTFVGLVLI AGLVLLYQRRS KTLE ELANDI KEDAI APRTLPW 
TKGSDTISKNGTLSSVTSARALRPPKAAPPRPGTFTPTPSVSSQALSSPRLPRVDEPPPQ 

AVSLTPGGVS SSALS RMG AVPVMVPAQSQAGSLV 
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ALIGN calculates a global alignment of two sequence* 
version 2.0uPlease cltei Kyers and Killer, CABIOS (1989) 

> hT258a.a, 370 aa vs. 

> mT258 a. a. 394 aa 

scoring matrix: paml20.mat, gap penalties: -12/-4 

62.8% identity; Global alignment score: 1085 

10 20 30 40 SO 60 

inputs MISLPGPLVTKIJaiFLrLGLSAI^PSRAQLQLHLPA- -NRLQAVEEGESGASAWYTLKREVSSSQPWEV 

. . :.:.::: : .::::. :: » :.:•'■: 

MI LQAGT PETS LLRVLFLGLSTLAAFS RAQMELHVP PGLWKLEAVEGEEVVLPAWYTMAREESWSHPREV 
10 20 30 40 SO 60 70 

70 80 90 100 110 120 130 

inputs PFVKWFFKQKEKE-DQVWYING\nTS 

P I L I W F L EQ EG K£ PMQVLS Y I MGVMTNKPGT AL VHS I S S RNVS LRLG ALQ EGDSGTYRCS VNVQND EG KS 
60 90 100 110 120 130 140 

140 ISO 160 170 180 190 200 

inputs RGHSIKTI-ELOTLVPPAPPSCRUX^ 

:::::..::.:::::::::: : : : : : : : . : : s :.:.::::::..: i * * • * 11 *"****"" 
IGKSIKSI EXJCVLVPPAPPSCSLQGVPYVGTNVTLNCKS PRS KPT AQYQWER1AP S S QVFFG P ALD AVRG 
ISO 160 170 180 190 200 210 

210 220 230 240 2S0 260 270 

inputs S LS LTKLS S S MAGVYVC KAHN EVGT AQCNVT L EVSTG PGAA WAEA WGTLVG 1/3 LLAG L VLL YKRRG KA 
• : : : : : : .:.:::::::.: : : :.:::::.: : : . : : : : : :::::-: * : : " 1 S ' ! * * S * ' * " 
S LKLTNLS I AMSGVYVCKAQNRVG F AKCNVT LD VKTGS KAA WAG AWGT FVG LVL I AGL VLL YQRRS KT 
220 230 240 250 260 270 260 

280 290 300 310 320 330 340 

inputs LE E P AND I KEDAI APRTL PWPKS SOT I S WJGTLSSVTS ARALRP PHG - P PRPG ALT PT P S LS SQALPS PR 

LEEL^Il^DAI/^l^ 

29C 300 310 320 330 340 3S0 

3S0 360 370 

inputs HAH - DRWGPPSTNIPHPWWGFFLWL 

LPRVD E PP PC AVS LTPGGVSSS ALS RMGAVP VMVP AQSQAG S L- V 
360 370 380 390 
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ALIGN calculates a global alignment of two sequences 
version 2-0uPlease cite: Myers and Miller, CAJBIOS (1989) 

> hT258a.a. 370 aa vs. 

> SwissProt Q99795 - (untitled) 319 aa 
scoring matrix: paml20.mat, gap penalties: -12/-4 , . 
23.0% identity; Global alignment score: -102 

10 20 * 30 40 50 60 

inputs MISLPGPLVTNLXRFLFLGLSALAPPSRAQLQLHLPANRLQAVEEG-ESGASAWYTLHREVSSSQPWEVP 

MVGKMWPNrt^nTLCA-VRVTTOAISVETPQD^^ 

10 20 30 40 50 60 



70 80 90 100 110 120 130 

inputs FVMWFFKQKEKEDQVLSYINGVTTSKPGVSLVYSMPSRNLSLRVEGLQEKDSGPYSCSVNVQDKQGKSRG 

WIWPFSNKN YIHG-ELYKNRVSISNNAEQSDASITIDQLTMADNGTYECSVSLMSDLE G 

70 80 90 100 110 120 

140 150 160 170 180 190 200 

inputs HSIKTLELNVLVPPAPPSCRLQGVPHVGANVTLSCQSPRSKPAVQYQWDR*- -QLPSFQTFFAPALDVIRG 

NTKSRVRLLVLVPPSKPECGIEGETIIGNNIQLTCQSKEGSPTPQYSWKRYNILNQEQPLAQPASGQ 

130 140 150 160 170 180 190 

210 220 230 240 250 260 270 

.nputs S LSLTNLS S S MAG VYVCKAHNE VGTAQCNVTLEVS TGP - G AAWAEA WGTLVGLGLLAGLVLL YHRRGK 

PVSLKNISTDTSGYYICTSSNEEGTQFCNITVAVRSPSMNVALYVGIAVGVVAALIIIGIIIYCCCCRGK 
200 210 220 230 240 250 260 

280 290 300 310 320 330 340 

nputs ALEEPANDIKEDAIAPRTLPWPKSSDTISKNGTLSSVTSARALRPPHGPPRPGALTPTPSLSSQALPSPR 
■ «••:••«: ■ : : • •••• 

- - DDNTED - KEDA - RPNREAYEE P - PEQLRELSREREEE - DDYR 

270 280 290 300 



350 360 370 

iputs HAHDRWGPPSTNI PHPWWGFFLWL 

QEEQR - - STGRES PDH LDQ 

310 
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ALIGN calculates a global alignment of two sequences 
version 2.0uPlease cite: Myers and Miller, CABIOS (1989) 

> hT258 n.a. 1869 aa vs. 

> Genfiank U7972S - Human A33 antigen precursor tnR 2793 aa 
scoring matrix: paml20.mat, gap penalties: -12/-4 

40.$% identity; Global alignment score: 1182 



10 20 
inputs GTCGACCC ACGCGTNCNT CCAG- 



- -CTACCCCTTTGTGAGCAGTCTAGGACTTTGTACACCTCnTAAGTAGGGAGAAGGCAGGGGAGGTGGCT 
10 20 30 40 50 60 

30 40 50 

i npu ts GTNC GG AG - CCGC CCT GGGTGTCA - GCG - GC 

GGTTTAAGGGGAACTTGAGGGAAGTAGGGAAGACTCCTCTTGGGACCTTTGGAGTAGGTGACACATGAGC 
70 80 90 100 110 120 130 

60 70 80 90 
inputs TCGGCTCCCGCGCAC- -GC TCCGGCCGT CGCGC-AGCCT CGGCA C C 



CCAGCCCCAGCTCACCTGCCAATCCAGCTGAGGAGCTCACCTGCCAATCCAGCTGAGGCTGGGCAGAGGT 
140 ISO 160 170 180 190 200 

100 110 120 
inputs. - TGCAGG TCC GTGC- -GTCCCG- - -CGGCTGGCGCC CCTG 



GGGTG AG AAGAGGGAAAATTGCAGGGACCTCCAGTTGGGCCAGGCCAGAAGCTGCTGTAGCTTTAACCAG 
210 220 230 240 250 260 270 

130 140 150 
inputs AC TCCGTCC - CGGCCAGGGA GGGC- - CATGA 

ACAGCTCAGACCTGTCTGGAGGCTGCCAGTGACAGGTTAGGTTTAGGGCAGAGAAGAAGCAAGACCATGG 
280 290 300 310 320 330 340 

160 170 180 
i npu t S TTT CC CTCCCGGGGCC - - CCTGGTGACCAAC - TTGN 

TGGGGAAGATGTGGCCTGTGTTGTGGACACTCTGTGCAGTCAGGGTGACCGTCGATGCCATCTCTGTC 
350 360 370 380 390 400 410 

190 200 210 220 
inputs TGC GGTTTTTGTTC - -CTGGGGCTGAGTG- - -CCCT-C- - -GCGCC- -CC 

AACTCCGCAGGACGTTCTTCX3GGCTTCGCAGGGAAAGAGTGTCA 
420 430 440 450 460 470 480 

230 240 250 260 

inputs -CCTC- GCGGGCC CA GCTGCAACT -GCACTTGC CCGCC 

:::: ::::: :: ::::.::.:•:.:.: : : 

ACCTCCAGTCGAGAGGGACTTATTCAATGGGATAAGCTCCTCCTCACTCATACGGAAAG 
490 500 510 S20 530 S40 550 

270 260 290 300 310 320 

inputs AACOGGTTGCAGGCGGTGG - AGGAGGG GGAAAGTGGTGCTTCAGCATGGTACACCTTGC 

..::: :: ::..: .::.:.: ..:::.. :: :::::::. it 

GGCCGTTTTCAAACAAAAACTACATCCATOGTGAG 
S60 570 580 590 600 610 620 

330 340 356 360 

inputs A CAGGGAGGTGTCTTCATC-CCA GCCATGOOAGG TOC-CCTT- -TGTGATGT 

::: s.s: it sua ::: .tunt «s« it ttt. .< ».! : 

TGAGCAGTCCGATGOCTCCATCACCATTGATCAGCTO^ 
630 640 650 660 670 660 690 

370 380 390 400 410 

inputs GGTTCT TCAAAC- -AGAAAGAAAAGGAGGATCAQGTGT TXJTCCT 

: a ttt.ti . i : . . t 1 1> . •••fiSStst lilts 
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OTCTCOCTOATOTCAOACCTOTAGCXJCAACA^ 
7 <>0 710 720 730 740 750 760 

420 430 440 

inputs ACATCAA TGGGGTCA-CAACAAGCAAACCTG- -GAOTATC 



CCAAACCAOAATGCGOCATCOAOGGAGAGACCATAATTGGGAACAACATCCAGCTOACCTGCCAATCAAA 
770 780 790 800 810 820 830 

450 460 470 480 
inputs CTTGGTCTACTCCATGCCCTC CCGGAA CCTGTC 

GGAGGGCT - CACCAACCCCTCAGTACAGCTGGAAGAGGTACAACATCCTGAATCAGGAGCAGCCCCTGGC 
840 8S0 660 870 680 890 900 

490 500 510 520 
inputs CCTGC GGGT - GG AGGGTCTCC - AGGAGAAAGACTC- - -TGGCCC- - CTACAGCTG- 



CCAGCCAGCCTCAGGTCAGCCTGTCTCCCTGAAGAATATCTCCACAGACACATCGGGTTACTACATCTGT 
910 920 930 940 950 960 970 

530 540 550 560 570 
inputs - * CTCCGTG AATGTGCAAGACAAACAAG - - - GCAA * ATCTAGG - GGCCA - C AG CAT- - -CA 

ACCTCCAGCAATGAGGAGGGGACGCAGTTCTGCAACATCACGGTGGCCGTCAGATCTCCCTCCATGAACG 
980 990 1000 1010 1020 1030 1040 

580 590 600 610 620 
inputs AAACC TTAG AACTC AATG T AC - TGGTT CCTC CA - - - GCTCCTCCATC - - CTG 



TG G C C CTGT ATG TGGG C ATCG CGGTG GG C GTGGTTG C AGCCCTC ATT AT C ATTGGC ATC ATC AT CT ACTG 
1050 1060 1070 1080 1090 1100 1110 

630 640 650 660 670 
inputs C- - CGTCT - CCAGGGTGTG - -C CCCATGTG GGGGCAAACGTGACC CTGAGCTGCCAGT 



CTGCTGCTGCCGAGGGAAGGACGACAACACTGAAGACAAGGAGGATGCAAGGCCGAACCGGGAAGCCTAT 
1120 1130 1140 1150 1160 1170 1180 

680 690 700 710 
inputs - - CTCCAAGG - AG -TAAGCCCGCTGTCCAA TAC- - -CAGTGGG 



GAGGAGCCACCAGAGCAGCTAAGAGAACTTTCCAGAGAGAGGGAGGAGGAGGATGACTACAGGCAAGAAG 
1190 1200 1210 1220 1230 1240 1250 

720 730 740 750 760 

nputs ATCGG CA GCTTCCATCC • • - TTCCAG - ACTTTCTTTG - CA - - CCAGCATT AG ATGTCATCC 

AGCAGAGGAGCACTGGGCGTGAATCCCCGOACCACCTCGACCAGTX3ACAGQCCAGCAGCAGAGGGCTO 
1260 1270 1280 1290 1300 1310 1320 

770 780 790 800 810 820 

nputs GTGGG- TCTTTAAGCCTCA - CCAA CCTTTCGTCTTCCATGGCTGG AGTCTA - TGT 

2 • 2 « • • • t j • ^ * • ■ 2 2 • ? J I 2 2 2 2X2 •••••• « jj j 225 

GAGGAAGGGTTAGGGGTTCATTCTCCOGCTTCCTW 
1330 1340 1350 1360 1370 1380 1390 

830 840 850 860 870 

n P utB CTGCAAGGCCCA- -CAATOAGGTGGGCA CTGCCC-AATGTAA- -TGTQ AC GCTGG 

: . : : : : 22.222222.22 22 222 2.222: 2222 22- 22222 

CCCTCCATCCCAGACATTGATGGGGACATTTCTTCCCCAGTO 
.1400 . 1410 1420 1430 1440 * 1450 1460 

.680 690 900 910 920 930 

aputs -JUKmBtfl-CftCMjq^ - -OGTTO-QACT 

•.tit t«S f t.2 2 2 St*. lilt.'S. f «ftt« tltt wmt tttl IX 22 Itt. 

TAAGGGOOTCCCTXr ro C-TQATCCiroC TO ^^ 
1470 1460 1490* .1500 1510 1520 1$30 

: 

940 950 960 970 960 
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inputs OOOOTTOCTOOCTOOGC TOOTCCT - -CTTGTACCACCGC COO OGCAAQ-GC- 

t tu tn its . .i.s:. !t . •ss.ss.t :« 85lS *I*JljL 

CCWKmi<XK«K^TOOCCCTCACrCAA 
1S40 1550 1560 1570 1580 1590 1600 

990 1000 1010 1020 
inputs CCTOGAGG- - -AGC-CAG CCAAT- -GATATC AAGGAGG- -ATGCCAT 

CCTGGAAGCACAGCGCTGAGCATGGGGCGCTCCCACTCAGAACTCTCCAGGGAGGOT 
1610 1620 1630 1640 1650 1660 1670 

1030 1040 1050 
i npu ts TGCTCCCCGGA - - - CCCTGC - CCTGG C - CCAAGAG CTCAG 

GGGGTGGGGGCTGTCCTGCTCACCTGTGTGCCCAGCACCTGGAGGGGCACCAGGTGGAGGGTTTGCACTC 
1680 1690 1700 1710 1720 1730 1740 

1060 1070 1080 1090 
inputs -ACACAATCTCCAAGAATG - GGACCCT TT - -CCTCTGT 

CACACATCTTTCTTGAATGAATGAAAGAATAAGTGAGTATGCTTGGGCCCTGCATTGGCCTGGCCTCCAG 
1750 1760 1770 1780 1790 1800 1810 

1100 1110 1120 
inputs CACC TCCGCACGAGCC CT-CCGG CCA--CCC-C ATGGCC- -C 



CTCCCACTCCCTTTCCAACCTCACTTCCCGTAGCTGCCAGTATGTTCCAAACCCTCCTGGGAAGGCCACC 
1820 1630 1840 1850 1860 1870 1880 

1130 1140 1150 1160 1170 
i npu t s TCCC AGGCCTGGTGCATTGACCC CC ACGCCC AGTCTATC - CAGCCAGGC - - 

TCCCACTCCTGCTGCACAGGCCCTGGGGAGCTTTTGCCCACACACTTTCCATCTCTGCCTGTCAATATCG 
1890 1900 1910 1920 1930 1940 1950 

1180 1190 1200 1210 1220 1230 

inputs - - CCTGCCCTCACCAAGACATGCCCACGACAGATGGGG - -CCC- -ACCCTCAACCAATATCCCCCATCCC 

TACCTGTCC - CTCCAGGCCCATCTCAAATC AC AAGGATTTCTCTAACCCTATCCTAATTGTCCACATACG 
1960 1970 1980 1990 2000 2010 2020 

1240 1250 1260 1270 1280 
inputs TGGTGG GGTTT- - -TTTCCTTTGGCTT TGAGCCGCATGG GT- -GCTGNGC 

TGGAAACAATCCTGTTACTCTGTCCCACGTCCAATCATGGGCCACAAGGCACAGTCTTCTC 

2030 2040 2050 2060 2070 2080 2090 

1290 1300 1310 1320 1330 

inputs CTGTG ATGGNGC - - CTGC - CCA - G AGTCAAG - - CTGGCTCTC * TGG - - TATGATG ACCC C 

:•: :::: : : 2 s : s : . : i : ' 

TCTCACTGTATTAGAGCGCCAGCTCCTTGGGGCAGGGCCTGGGCCTCATGGC^ 

2100 2110 2120 2130 2140 2150 2160 

1340 1350 1360 1370 1380 

inputs AC CACTCAT TGG - - - CTAAAG - - G ATTTGGGOTCTCTCCTTCCT ATAAGGGT 

: ::: : s . : : : • * s * * V s 

CCTAGTAGCTGGCGCCCATCCTAGTGGGCACTTAAGCTTA^ 

2170 2180 2190 2200 2210 2220 2230 

1390 1400 1410 1420 1430 

inputs --CAC--CTCTAG-CAC AG A -GGCCTOAGTCATGGGAAAC^<MX^C^CTCCTGACCC TTAG 

: : : . : t 1 1 s • t s x . 1 1 . 1 1 . i . 1 1 : t ::: n * 

TTCCCTrCTCTOGTCTCCTTGAGATGATCG^^ - CCAAACCCACGT ATTCA 

2240 2250 2260 2270 2260 2290 2300 

1440 1450 1460 1470 1460 1496 1500 

inputs TACTCTOCCCCCACCTCTCTTTACr^^ 

I • S . It I S • I .tli I I tl • S • S t t I • I I I • • • • S * 

TTGAGTGAGTTAAACACX3AATTGATCTAAAGTGAACACACAC -CAOATGGTCTOA 
2310 2320 2330 k 2340 23S0 2360 2370 
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1510 1520 1530 1540 1550 1560 

i npu t S AG G AG AAG AGG AAGT GG ATCTGG AATTGGG AGG AGCCTCC ACCCACCCCTG ACTCCTC 

GTTCTTGTGTCCTGGTAATTCCTCTCCAGGCCAGAATAATTGGCATGTCTCCTCAACCCACATGGGGTTC 
2380 2390 .2400 2410 2420 2430 2440 

1570 1580 1590 1600 1610 1620 
i npu t s CTTATGAAGCCAGCTGCTGAAATTAGCTACTCA- - CCAAG AG TGAGGGGCA-GAGACTTC 

• • • ••»• ■ • . » • ■ • . •« • • • * 

• « a • •••*■> • ■•«• • a a • . . . . » • * a • • ••»•• « • • ••• 

CTGGTTGTTCCTGCATCCCGATACCTCAGCCCTGGCCCTGCCCAGCCCATTTGGGCTCTGGTTTTCTGGT 
2450 2460 2470 2480 2490 2500 2510 

1630 1640 1650 1660 1670 

i npu C S C AGTCACTG AGTC - - TCCCA - GGCCCCCTT GATCTGTACCCC ACCCCTATCTAACAC 

GGGGCTGTC-CTGCTGCCCTCCCACAGCCTCCTTCTGTTTGTCGAGCATTTCTTCTACTCTTGAGAGCTC 
2520 2530 2540 2550 2560 2570 2580 

1680 1690 1700 1710 1720 1730 

inputs CACCCTT- -GGCTCCCA CTCCAGCTCCCTGTATTGATATAACCTGTCAG- -GCTGGCTTGGTT 

• a a a a ■ a a • a • a • * a a m a a a a a • a • • • 

• a • •••• « • a • a a a a a • • a • a a a • • a a a a > •••• 

AGGCAGCGTTAGGGCTGCTTAGGTCTCATGGACCAGTGGCTGGTCTCACCCAACTCCAGTTTACTATTGC 
2590 2600 2610 2620 2630 2640 2650 

1740 1750 1760 1770 1780 1790 

i np u 1 5 AGGTTTTACTGGG - GCAG AGG ATAGGG AATC TCTTATTAAAACTAAC - ATG AAATATGTGTTG T 

TATCTTTTCTGGATGATCAGAAAAATAATTCCATA 

2660 2670 2680 2690 2700 2710 2720 

1800 1810 1820 1830 1840 1850 1860 

.nputs TTTCATTTGCAAATTTAAATAAAG - - - ATACATAATGTTTGTATGAGATAAGAAAAAAAAAAAAAAAGGG 

• • a a a m a a • a a a • « • a* a » - ■ a aa a a a a • ? ? 2 « „ - .221 

ATATTTTTATATATATTGTT AAATCCTTTGCTTCAT - TCCAAATGCTTTC AGTAAT AATAAAATTGTGGG 
2730 2740 2750 2760 2770 2780 2790 



nputs CGGCCGC 
TGG 
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ALIGN calculates a global alignment of two sequences 
version 2.0uPlease cite: Myers and Miller, CABIOS (1989) 

> mT258 a. a. 394 aa vs. 

> SwissProt Q99795 - (untitled) 319 aa 
scoring matrix: paml20.mat, gap penalties : -12/-4 

23.0% identity; Global alignment score: -149 

10 20 30 40 50 60 70 

inputs MILQAGTPETSLLRVXFLGLSTLAAFSRAQMELHVPPGLNKL^ 

: . : : : : : . . : .::.::.::"•• • 

MV -GKMWPVLW TLCAVRVTVDAISVETPQDVLRASQGKSVTLPCTYHTSTSSREGLIQWD 

10 20 30 40 50 

80 90 100 110 120 130 140 

inputs PILIWFLEQEGK£PNQVLSYINGV>ITNKPGT^ 



KJLLLTHTERWIWPFSNKNYIHGELYKNR-VSISNNAEQSDASITIDQLTMADNGTYECSVSLMSDLE-- 
60 70 80 90 100 110 120 

150 160 ;70 180 190 200 210 

inputs IGHSIKSIELKVLVPPAPPSCSLQviVPVVGT^ 

- GNTKS R VRLL VLVP PS KPECG I EGETI I GNNIQLTCQS KEGS PTPQYS WKR YNI LNQE - - QPLAQPASG 
130 140 150 160 170 180 190 

220 230 240 250 260 270 

Lnputs -SLKLTNLS I AMSGVTVCKAQNRVGFAKCNVTLDVMTGS - KAAWAGAWGTFVGLVTjIAGLVLIjYQRRS 

QPVSLKNISTDTSGYYICTSSNEEGTQFCNITVAVRSPSMNVALYVGIAVGWAALIIIG--IIIYCCCC 
200 210 220 230 240 250 260 

280 290 300 310 320 330 340 

.nputs KTLEELANDIKSDAIAPRTLPWTKGSDTISKN^^ 



RGKDDNTED - KEDA - RPNREAYEEPPEQ - - - LRELSR 

270 280 290 

350 360 370 380 390 

nputs PRL PR VD E P P PQ AVS LT PGG VS S S ALS RMG AVP VMVP AQS Q AGS LV 



EREE- -EDDYRQEEQRSTGRESPDHLDQ- 
300 310 
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ALIGN calculates a global alignment of two sequences 
version 2.0uPlease cite* Myers and Miller, CABIOS (1989) 

> «T258 n.a. 1846 aa vs. 

> OenBank U79725 - Human A33 antigen precursor mR 2793 aa 
scoring matrix: paral20.mat, gap penalties! -12/-4 

40.0% identity; Global alignment score: 908 

10 20 30 

i npu t s GTCGACCC ACGC - GTC CG- -GTGCAC- - ATT C-- GGGTTGCCGCC 

: : : : : : . : : : : : : - : : : 

- - CTACCCCTITGTGAGCAGTCTAGGACTTTGTACACCTGTTAAGTAGGGAGAAGGCAGGGGAGGTGGCT 
10 20 ' 30 40 50 60 

40 50 60 70 
inputs G - CT C ACC - CACAACACCTGT AG AC AC - CGTGTGT 



GGTTTAAGGGGAACTTGAGGGAAGTAGGGAAGACTCCTCTTGGGACCTTTGGAGTAGGTGACACATGAGC 
70 80 90 100 110 120 130 

80 90 100 110 

inputs CCAAC TCTCC CTGAGTA - CTC - -. CGGGCCA AGG - AGGGCCATGAT 

CCAGCCCCAGCTCACCTGCCAATCCAGCTGAGGAGCTCACCTGCCAATCCAGCTGAGGCTGGGCAGAGGT 
140 150 160 170 180 190 200 



120 130 140 150 160 

inputs TCTTCAG GCTGGAACCCCCGA - - -GACCAG-C - - - TTG CTGCGGG TT - TTGTTCCTG 

GGGTGAGAAGAGGGAAAATTGCAGGGACCTCCAGTTGGGCCAGGCCAGAAGCTGCTGTAGCTTTAACCAG 
210 220 230 240 250 260 270 

170 180 190 200 210 

inputs G - G ACTG AGTACCCTTGCTGCCTTCTCCCG AGCTC AG ATGG AGTT GC A CGTGCCC - - 

: : : : : : : : : 

ACAGCTCAGA- - CCTGTCTGGAGGCTGCCAGTGACAGGTTAGGTTTAGGGCAGAGAAGAAGCAAGACCAT 
280 290 300 310 320 330 340 

220 230 240 250 

inputs CC GGGC - CTCAA - - CAAATTGG AAG - CGGTAG AGGGAGAAGAAGTG 

GGTGGGGAAGATGTCGCCTGTGTTGTGGACACTCTGTCCAGTCAGGGTGACCGTCGATGCCATCTCTGTC 
350 360 370 380 390 400 410 

260 270 280 290 300 
inputs GTGCTCCCCGCCTG- -GTACA-CGA TGGCACGGGAGGAGT CGTGGTCC 

• • • • • • • • • • • • • • » « **«*««••« • • • • 

G AA - ACTCCGCAGGACGTTCTTCGGGCTTCG CAGGGAAAGAGTGTCACCCTGCCCTGCACCTACCACACT 
420 430 440 4S0 460 470 480 



310 320 330 340 350 

inputs - -CACC-CC-- - CGGG AGGTGCCCATCCT GATCTGGTTCT TGGAACAAGAAGGGAAGGAA 

2 X 2 2 2 2 • •••«• - ••• m « ■ «*•• • • a "■«•»__« • 

* * * • • • ••••••• » • a s • • » • « • mm m ... .a*. .«*••«« . a 

TCCACClXX^GTCGAaAGGGACTTATTCAATGGGATAAGCTC 

490 500 510 520 530 540 550 

360 370 380 390 400 

inputs CCAAACCAGGTGTTGTCTTA CATTAATGGAGTCATGACAAATAAACCTG- - - 

i . . . i i . : 
TCTGGCCGTTriXIAAACAAAAACTACATCXATGGTGAG 

560 570 580 590 600 610 620 

410 420 43<T 440 450 
inputs GAACAGCCCTGGTCCAC- -TCT ATCT CTTCACGG AATGTGTC - CCTGCG 

II. Ill IS . U.I II. III. 1*1 .1 II I I lit. I I 

TOCTCA0CAGTOOGATGCC1CCATCACCATTCATC 



630 640 650 660 670 680 . 690 

460 % 470 480 * 490 500 % 

inputs -C CTGGGGGCACTCCAGG AGGGAG ACTCTGGGAC - - - TTACCGCTOTTCTGTCAATGTGC - - - 

* ill. iit.ii.itiitt^. is.t..:.i i .tizs i in . nil 
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TCrcTCTCGCTGATGTCAGACCTGQAGG^ 

700 710 720 730 740 750 760 

510 520 530 540 550 

inputs AG AATG ATG AAGGCAA - -AAGTATAGGCCACA GCATCAAAAGCATA- -GAGCT- -CAA- - 

: . . : s • • : : : : . : . : : : : : . : : : : : • • : : : : ' * ' 

CCTCCAAACCAGAATGCGGCATCGAGGGAGAGACCATAATTGGGAACAACATCCAGCTGACCTGCCAATC 
770 780 790 800 810 820 830 

560 570 580 590 600 610 620 

inputs AGTGCTGGTTCCTCCAGCTCCTCCATCCTGTAGTTT ACAGGGTGTAC - - CCT ATGTCGG G A CCAAT 

: . . : - : : : : : : : : . : . : ... : : : . . : : : : " 

AAAGG AGGGCTC ACCAACCCCTCAGT ACAGCTGG AAG - AGGTACAAC ATCCTG AATCAGG AGC AGCCCCT 
840 950 860 870 880 890 900 

630 640 650 660 
inputs GT G ACC CTGAACTGCAAGTCCCCAAGGAGTAAA ---CC - TACTGC - TC 

GGCCCAGCCAGCCTCAGGTCAGCCTGTCTCCCTGAAGAATATCTCCACAGACACATCGGGTTACTACATC 
910 920 930 940 950 960 970 

670 680 690 700 710 
inputs AGTACCA GTGGGAGAG - -GCTG GCCCCATC-CT CC- -CAGGTCT TCTTTGG 

TGTACCTCCAGCAATGAGGAGGGGACGCAGTTCTGCAACATCACGGTGGCCGTCAGATCTCCCTCCATGA 
980 990 1000 1010 1020 1030 1040 

720 730 740 750 760 
inputs AC - CAGCCTTAG ATG CTGTTCGTGGATCTTTAAAGC TCACTAACCTT TC- -CAT 

ACGTGGCCCTGTATGTGGGCATCGCGGTGGGCGTGGTTGCAGCCCTCATTATCATTGGCATCATCATCTA 
1050 1060 1070 1080 1090 1100 1110 



770 780 790 800 

i npu ts - - - TGCC ATG TCTGG AGTCTATGT - - CTGCAAGGCTCAAAAC AG AGTGG 

CTGCTGCTGCTGCCG AGGG AAGG ACGACAACACTG AAG ACAAGG AGGATGCAAGGC - CGAACCGG - GAAG 
1120 1130 1140 1150 1160 1170 1180 



810 820 830 840 
inputs GCTTTG CCA- -AGTGCAAC GTGACCTT GGACGTGATG - -ACAGG- - 

CCTATGAGGAGCCACCAGAGCAGCTAAGAGAACTTTCCAGAGAGAGGGAGGAGGAGGATGACTACAGGCA 
1190 1200 1210 1220 1230 1240 1250 

850 860 870 680 

inputs GTCCAAGGCTGCAGTGGTCG CTGG - - AG CAGTTGTGGG 

AGAAGAGCAGAGGAGCACTGGGCGTGAATCCCCGGACCACCTCGACCAGTGACAGGCCAGCAGCAGAGGG 
1260 1270 1280 1290 1300 1310 1320 

690 900 910 920 
Inputs CA - CTTTTGTTGGGTTGGTG CTGATAGCTGGGCT GGTCCTGTT- - 

• • • . .... a - - «.»«.« • *•»«•* 

• • 

CXMCGGAGGAAGGGTTAGGGGTTCATTCTCCCGCTTCCTGGCCTCCCTTCT 

1330 1340 13S0 1360 1370 1380 1390 

930 940 

.nputs GTACCAG CGCC GGAGCAAGAC 

. :* s : s : : : : 

CCTGTCCCTCCATCCCAGACATTGATGGGGACATTTCITCXX^ 

1400 1410 1420 1430 1440 1450 1460 

950 960 970 980 
nputs CTTGGAA GAGCTGG -CCAA-TGA -TATCAAG - G AAGATGCC ATT 

I 1 I I . t I . I S t t . St.. ttl i.M .1 UM...U . S 

CClXKyTAAGGG<K3TCCCroTOCTGATCCTGCTO 

1470 1480 1490 1500 1510 1S20 . « 1530 

990 1000 1010 1020 
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inputs OCTCCC COOACCTTOCCTT GOACCAA AOGCTC AGACACAA 

lllLl I I I. HI III I .11111. tt si I .1.11 .1 

ACACCTCKJTGCGOOCCTGGCCCTCACTC^ 

1540 1550 1560 1570 1580 1590 1600 

1030 1040 1050 1060 1070 1OB0 
inputs TCTCCAAG AATGG - GACACTTT - CTTCGGTCACCTCAOCAC - GAGCTCT GCG- -GCCACCCA 

• • ••*•!• • 3 * « • . 3 3 • 8 • 2 ■ •■ 3 3 3 3 • • • ■ • « • * • 

GCTCCTGGAAGCACAGCGCTGAGCATGGGGCGCTCCCACTXTAG^ 

1610 1620 1630 1640 16S0 1660 1670 

.• 

1090 1100 1110 
1 npu t s AGG - CTGCTC --CT CCAAG ACCTGG CAC ATTTACT 

TGGGGGGTGGGGGCTGTCCTGCTCACCTGTC^ 

1680 1690 1700 1710 1720 1730 1740 

„ 1120 1130 1140 1150 

inputs C - CCACAC - - C - - - C AGTGT CTCTAGCCAGGCCCTGTCCT - - - CAC 

CTCCACACATCTTTCTTGAATGAATGAAAGAATAAGTGAGTATGCT 

1750 1760 1770 1780 1790 1800 1810 

1160 1170 1180 1190 1200 1210 

inputs CAAGACT- - - GCCCAGGGTAGATGAACC - CCCACCTCAGGCAGT - - GTCCCTG ACCC - - CAGGTGGGGTT 

CAGCTCCCACTCCCTTTCCAACCTCACTTCCCGTAGCTGCCAGTATGTTCCAAACCCTCCTGGGAAGGCC 
1820 1830 1840 1850 1860 1870 1880 

122 <> 1230 1240 1250 
.nputs TCTTC TTCTGCTCTGAGCC GCATGGG - TG CTGTGCCTGT - GATG 

ACCTCCCACTCCTGCTGCACAGGCCCTGGGGAGCTTTTGCCCA^ 

1890 1900 1910 1920 1930 1940 1950 

1260 1270 1280 1290 1300 1310 

nputs - -GTGCCTG- - - CACAGAGTCAGGCT -GGGTCTCTTGTGTG A- - - T AGCCCAGGCACTCATTAGCTACAT 

• •••••• 2*1 . . * l , ; f .■.!!.!..! - ■ » - • • • • • ... * • • • • 

TCGTACCTGTCCCTCCAGGCCCATCTCAAATCACAAGGATTTCTCTAACCCTATC - CTAATTGTCCACAT 
I960 1970 1980 1990 2000 2010 2020 

1320 1330 1340 1350 1360 1370 

nputs - C - TGGTATCTG ACCT - - TTCTGT AAAGGTC - TCCTT - - GTGGCACAG AGG ACTCAATCTT - -GGGAGGA 



. . #i i s s - . . *5 ' t "5; : . ; :.. 

ACGTGGAAACAATCCTGTTACTCTGTCCCACX3TCCAATCATGGGCCACAAGGCA(^ 

2030 2040 2050 2060 2070 2080 2090 

138 <> 1390 1400 1410 1420 

nputs TGCCCACA- - -TTCTAGACCTCCAG -TCCTTTG - - CT- - -CCTA- -CCTC CTT- - -CTAT- T -TGT 

:s: s * ,s : - ssss : :::: ::::: : :. :s:: ::: ::. 

TGCTCTCACTGTATTAGAGCXSCCAGCTCXrrTGGGGCAGGGCC^ 

2100 2110 2120 2130 2140 2150 2160 

1430 1440 1450 

nputs TG G AATACTGO - GCC - -TC- - AGTAAG - ACTAAA ATCTG 

•JJ :5J s *»* x? su«.i :::.s: : . : : : 

AGCCCTAGTAGCTGGCGCCCATCCTAGTGGGCACTTAAGCTTAATTGQTO 

2170 2180 2190 2200 2210 2220 2230 

1460 1470 1480 
npuCS GGTCA AAGGACAAAAGGAGGAAAT OGACC 



mi. . . . * : i : is*. 

GCCITCCCTTCrrcrGGTCTCCTTGAGATGATC^ 

2240 2250 2260 2270 2260 2290 2300 

1490 1500 ^ 1510 1520 1530 

Q P uts -TGAGGTAGG- - -GGGTTGGGAGTGAGGAGGCT-TCACTT CCTCCCTOCT7 TCT- 

' * 1 1 »!•• : • • 1 1 1 • • ♦•11 .in*. } t t till: t:: 

CATTCAGTGAGTTAAACACGAATTOATTTAAAGTGAACACACACAAOOQA 

2310 2320 2330 N 2340 2350 2360 2370 
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1540 1550 1560 1570 1580 

inputs CCCTG AAGCC AG ATG AATG CT GC--GGAAGATCGGCT ACCCTCCAAGGGCT 

: : : . , : : : : j : : . . • : : : : 

AGTTCTTGTGTCCTGGTAATTCCTCTCCAGGCCAGAATAATTGGCATGTCTCCTCAACCCACATGGGGTT 
2360 2390 2400 2410 2420 2430 2440 

1590 1600 1610 1620 1630 1640 

inputs C - TGG AGG AG ACTGCC AGTC AGTG ATGC - - CCCTGGCTCTG TGATCTGTAC AAC ACCC - TTATCTAA 

• ••• • »••• • « . •••*•••••• • • • • • S2»SIS«a 

CCTGGTTGTTCCTGCATCCCGATACCTCAGCCCTGGCCCTGCCCAGCCCATTTGGGCTCTGGTTTTCTGG 
2450 2460 2470 2480 2490 2500 2510 

1650 1660 1670 1680 1690 
i np u t s TG - - - CTGTCCTT - TGCCGTTCGCTCCATCTCC - - CTGT ATT AAT ATAAC 

TGGGGCTGTCCTGCTGCC-CTCCCACAGCCTCCTTCTGTTTGTCGAGCATTTCTTCTACTCTTGAGAGCT 
2520 2530 2540 2550 2560 2570 2580 

1700 1710 1720 
inputs CTGTC --CTGCT GGCT TGGCTGG GTTT- -TGTTG 

: . : : : : : : : : : : ::::::: 

CAGGCAGCGTTAGGGCTGCTTAGGTCTCATGGACCAGTGGCTGGTCTCACCCAACTGCAGTTTACTATTG 
2590 2600 2610 2620 2630 2640 2650 

1730 1740 1750 1760 1770 

inputs TAGCAGGGGG ATAGGAAAGACATTT - - TAAAATCTG ACTTG AAATTGATGTTTTTGTT 

CTATCTTTTCTGGATGATCAGAAAAATAATTCCATAAATCTAT^ 

2660 2670 2680 2690 2700 2710 2720 

1780 1790 1800 1810 1820 1830 1840 

i npu t s TTT ATTTTG CAAATTTC AAT AAAG A TACATCG CATTTGC ATGG AAAAAAAAAAAAAAAGGGCG 

• • •••• • a : : : . : 

TATATTTTTATATATATTGTTAAATCCTTTGCTTCATTCCAAATGCTTTCAGTAATA^ 

2730 • 2740 2750 2760 2770 2780 2790 



inputs GCCGC 
T--GG 
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ALIGN calculates a global alignment of two sequences 
version 2.0uPlease cite: Myers and Miller, CABIOS (1989) 

> hT258 n.a. 1869 aa vs. 

> pecaxn n.a. 2557 aa 
scoring matrix: paml20.mat, gap penalties: -12/-4 

40.5% identity; Global alignment score: 1546 

10 20 . 

inputs G TC GACC CAC GCGTNCNTC-CAGCGTN- 



GAATTCCGGGAGAAGTGACCAGAGCAATTTCTGCTTTTCACAGGGCGGGTTTCTCAACGGTGACTTGTGG 
10 20 30 40 50 60 70 

30 40 50 60 70 80 
inputs -CGGAGCCGCC — CTGGG — TGTCAGCGGCTCGGCTCCCGCGCACGCT CCGGC CGTCGC 



GCAGTGCCTTCTGCTGAGCGAGTCAT-GGCCCGAAGGCAGAACTAACTGTGCCTGCAGTCTTCACTCTCA 
80 90 100 110 120 130 

90 100 110 120 130 

inputs GCAGCCTCGGCA — CCTGCAGGTCCG TGCGTCCCG CGGCTGGCGCCCCTGACTCCGTC 

• ••••• • • • • * • • • m • • •••••• • ••• ■•■ • • 

GGATGCAGCCGAGGTGGGCCCAAGGGGCCACGATGTGGCTTGGAGTCCTGCTGACCCTTCTG-CTCTGTT 
140 150 160 170 180 190 200 

140 150 160 170 180 
inputs CCGG CCAGGG AGGGCC ATG ATTTCCCT — CCCGG — GGCC CCTGGTGA- CC AAC T 



• • • • 



CAAGCCTTG-AGGGTCAAGAAAACTCTTTCACAATCAACAGTGTTGACATGAAGAGCCTGCCGGACTGGA 
210 220 230 240 250 260 270 

190 200 210 220 230 
inputs TGNTGCGGTTT TTGTTCCTGGGGCTG-AGTGC — C-C TC-GCGCCCCC-CTCGCG GGCC 



••*••••••« w « • ••••• . * * •« • • a • • •• • •••• 

CGGTGCAAAATGGGAAGAACCTGACCCTGCAGTGCTTCGCGGATGTCAGCACCACCTCTCACGTCAAGCC 
280 290 300 310 320 330 340 

240 250 260 270 280 

inpu t S — CAGCTGCAACTGC ACTTGC C CGCCAACCGGTTGCAGGCGGTG 

::::• ::• ::: :: ::: : • ::. : . •:»:•: 

TCAGCACCAGATGCTGTTCTATAAGGATGACGTGCTGTTTTACAACATCTCCTCCATGAAGAGCACAGAG 
350 360 370 380 390 400 410 

290 300 310 320 330 340 

input 8 G AGG AGGGGGA AAGT- -GGTGCTTCAGCA-TGGTACACCT TGCACAGGGAGGTGTCTTCATC 

• •• :::: ::. ::. .:: ::: : m it lit •«n»: 

AGTTATTTTATTCCTGAAGTCCGGATCTATGACTC^GGGACATATAAATGTACTC 
420 430 440 450 460 470 480 

350 360 370 380 390 

inputs CCAG CCA-TGGGAGGTGCC — CTTT — GTGATGTGGTTCTTCAAACAGAAAGAAAAGGAGGATC 

::: :: :: : : • .::: 
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AAAGAGAAAACCACTGCAGAGTACCAGCTGTTGGTCK3 
«0 500 510 520 530 540 550 

400 410 420 430 440 450 

inputs AGGTGTTGTCCTACATCAATGGGGTCA CAACAAG-CAAACCTGGAGTAT CCTTGG-T 

. . * 

AGAAAGAGGCCATCCAAGGTGGGATCGTGAGGGTCAACTGTTCTGTCCCAGAGGAAAAGGCCCCAATACA 

560 570 580 590 . 600 610 620 

.- 

460 470 480 490 500 

inputs CTACTC CATGCCCTCCCGGAACC — TGTCCCTGC-GGGTGGAGGG TCTC 

: * • • • 2 : . : : : : : : : : • : : : : 

CTTCACAATTGAAAAACTTGAACTAAATGAAAAAATGGTCAAGCTGAAAAGAGAGAAGAATTCTCGAGAC 
630 640 650 660 670 680 690 

510 520 530 540 550 

inputs C AGG AG AAAG — ACTCTGG CCCCTAC -AG CTGCTCCGTGAATGTGC AAGACAAACAA 

s:: - ! : :::: . . : 

CAGAATTTTGTGATACTGGAATTCCCCGTTGAGGAACAGGACCGCGTTTTATCCTTCCGATGTCAAGCTA 
700 710 720 730 740 750 760 

560 570 580 590 
inputs GG — CAAATCTAGGGGCCA CAG-CATCAAAA CCTTA GAACTCAATG 



GGATCATTTCTGGGATCCATATGCAGACCTCAGAATCTACCAAGAGTGAACTGGTCACCGTGACGGAATC 
770 780 790 800 810 820 830 

600 610 620 630 

inputs -TACT GGTTCCTC — CAGCTCCTCC ATCCTG C-CGTCTCCA — GGGTG 

:::: 2:.:: ::: :: ; s . ::::: 

CTTCTCTACACCCAAGTTCCACATCAGCCCCACCGGAATGATCATGGAAGGAGCTCAGCTCCACATTAAG 
840 850 860 870 880 890 900 

640 650 660 670 680 690 

inputs TGCCCCATG-TGGGGGCAAACGTGACCCTG-AGCTGCCAG TC TC CAAGGAGTAAG 

s:: *m ..: :: :: : :: ♦: ::::: 2...: 

TGCACCATTCAAGTGACTCACCTGGCCCAGGAGTTTCCAGAAATCATAATTCAGAAGGACAAGGCGATTG 
910 920 930 940 950 960 970 

700 710 720 
inputs CCC GCTGT C CAATACCAGTG -GG ATC GGCAGCTT 

2 2 2 SSSSS " • • • • • •••«•• 

• • * * * • • •••••• • • • • * ••••*«•• 

TGGCCC^CAACAGACATGGCAACAAGGCTGTGTACTCAGTCATGGCCAT^ 
980 990 1000 1010 1020 1030 1040 

730 740 750 760 770 

Lnputs C-CATCCT TCCAGAC TTTCTTTG — CACCAGCATTAGATGTCATCCGTG — GGTCTTTA 

8 : * 5 *• 1 1: 22:212 s. :..t:.i. 

CAOGTGCAAAGTGGAGTCCAGCOGCATATCCAAGGTCAGCAGCATC^TGGTCAACAT 

1050 1060 1070 1080 1090 1100 ' 1110 

780 790 800 810 820 830 
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inputs ACCCTCACC-AACCTTTCGTCTTCCAT GOCTGGA GTCTATGTCTG — CAAGGCC — CACA 

• ::. .tz mt tt .zntit.: . tut: tt •t.t.tt: t .: tt t.t 

TTCCAAGCCCGAACTGGAATCTTCCTTCACACATCTGGACCAAGGTGAAAGACTGAACCTGTCCTC 
1120 * 1130 1140 1150 1160 1170 1180 

840 850 860 870 

inputs ATG — AGGTG GGC — ACTGOCCA ATGTAA TGTGACGCTGG AAGTGA 

* s : : : . : ^ . : : ::::::: : . : : : : : : • . 

ATCCCAGGAGCACCTCCAGCCAACTTCACCATCCAGAAGGAAGATACGATTGTGTCACAGACTCAAGATT 
1190 1200 1210 1220 1230 1240 1250 

880 890 900 910 920 930 

inputs GCAC AGGGCCT-GGAG-CTG-CAGTGGTTGCTGAAGCTGT — TGTGGGTACC — CTGGTTGGACTG 

: : : : . : : : : . . : : : : :::::: . : : . : : : : : . . . 

TCACCAAGATAGCCTCAAAGTCGGACAGTGGGACGTATATCTGCACTGCAGGTATTGACAAAGTGGTCAA 
1260 1270 1280 1290 1300 1310 1320 

940 950 960 970 980 
inputs GGGTTGCTG-GCTGGGCTGGTCCTCTTGTA C-CACC — GCCGGGG CAAG GCCCTG 



GAAAAGCAACACAGTCCAGATAGTCGTATGTGAAATGCTCTCCCAGCCCAGGATTTCTTATGATGCCCAG 
1330 1340 1350 1360 1370 , 1380 1390 

990 1000 1010 1020 1030 1040 
inputs GAGGAGCCAGCCAATGATATCAAG — G AGGATG- CCATTGCTCCCCGG ACC CT — GCC C 



• • • • 



TTTGAG — GTCATAAAAGGACAGACCATCGAAGTCCGTTGCGAATCGATCAGTGGAACTTTGCCTATTTC 
1400 1410 1420 1430 1440 1450 1460 

1050 1060 1070 1080 1090 
inputs TGGCC — CAA — GAGCTCAGACACAATCTCCAAGAATGGGACCCTTTCCTCTGTCA — CCTCCG C 



• •••••• 



TTACCAACTTTTAAAAACAAGTAAAGTTTTGGAGAATAGTACCAAGAACTCAAATGATCCTGCGGTATTC 
1470 1480 1490 1500 1510 1520 1530 

1100 1110 1120 1130 

inputs A CGAGCCCTCCG — GCCACC CCA TGGC CC — TCCCAGGCCT 

: :•: : : : * s ::: : :: ss ::::: 

AAAGACAACCCCACTGAAGACGTCGAATACCAGTGTGTTGCAGATAATTGCCATTCCCATGCCAAAATGT 
1540 1550 1560 1570 1580 1590 1600 

1140 1150 1160 1170 

inputs -GGTGCATTG ACCCCC- ACGCCCAG TCTATCCAGCCAGG 

**: : :::: : : tttt s:ttt::.s 

TAAGTGAGGTTCTGAGGGTGAAGGTGATAGCCCCGGTGGATGAGGTCC^ 

1610 1620 1630 1640 1650 1660 1670 

1180 

Lnputs c C CTGC CC--TCACCAAG 

: : ttt t :t itui.s 

GGTGGTGGAGTCTGGAGAGGACATTGTGCrrcCAATGTGCT 

1680 1690 1700 1710 1720 1730 1740 
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1190 1200 1210 1220 1230 1240 

inputs ACATGC — CCACGACAGATGGGGCCCACCCT-CAACC^-TATCCCC CATCCCTGG TGGGGT 

: : . it :.: ::::: ::: x..ii : : : 

AAGTTTTACAGAGAAAAAGAGGGCAAACCCTTCTATCAAATGACCTCAAATGCCACCCAGGCATTTTGGA 
1750 1760 1770 1780 1790 1800 1810 

1250 1260 1270 1280 1290 1300 
inputs TTTTTCCTTTGGCTT TG AG CCG CATGGGTG CTG NGCCTGTGATGGNGCCTGCC 



CCAAGCAGAAGGCTAGCAAGGAACAGGAGGGAGAGTATTACTGCACAGCCTTCAACAGAGCCAACCACGC 
1820 1830 1840 1850 ' 1860 1870 1880 

1310 1320 1330 1340 1350 

inputs CAGAGTC AAGCTGGCTCTCTG-GT-ATGATGACCCCACCACTCATTGG-CTAAAG 

:; :: : • . 

CTCCAGTGTCCCCAGAAGCAAAATACTGACAGTCAGAGTCATTCTTGCCCCATGGAAGAAAGGACTTATT 
1890 1900 1910 1920 1930 1940 1950 

1360 1370 1380 1390 
inputs G ATTTGGG -GTC-TC TCCTTCCTATAAGGGTCA CCTCTAGCA CAGAGG 

GCAGTGGTTATCATCGGAGTGATCATTGCTCTCTTGATCATTGCGGCCAAATGTTATTTTCTGAGGAAAG 
1960 1970 1980 1990 2000 2010 2020 

1400 1410 1420 1430 1440 1450 

inputs CCTGAGTCATG-GGAA-- — AGAGTCACACTCCTGACCCTTAGTAC TCTG — CCCCCACCTC 

: . ::::: :::: : - • * 

CCAAGGCCAAGCAGATGCCAGTGGAAATGTCCAGGCCAGCAGTACCACTTCTGAACTCCAACAACGAGAA 
2030 2040 2050 2060 2070 2080 2090 

1460 1470 1480 1490 1500 1510 

inputs TCTTTAC — TGTGGGAAAACCATC — TCAGTAAGACCTAAGTGTCCAGGAGACAGAAGGAGAA-GA 

::. . . : : . : ::: : . : . : 

AATGTCAGATCCCAATATGGAAGCTAACAGTCATTACGGTCACAATGAC—GATGTCAGAAACCATGCAA 
2100 2110 2120 2130 2140 2150 2160 

1520 1530 1540 1550 1560 1570 

inputs GG AAGTGGATCTGGAATTGGGAGGAGCCTCCACCCACC-CCTGAC — TCCTCC TTATGAAGCCAGC 

s :::: . s«: : . : : :•: : 
TGAAACCAATAAATGATAATAAAGAGCCTCTGAACTCAGACGTGCAGTACACGGAAGTTCAAGTGTCCTC 
2170 2180 2190 2200 2210 2220 2230 

1580 1590 1600 1610 1620 1630 1640 

inputs TGCTGAAATTAGCTACTCACCAAGAGTGAGGGGCAGAGACTTCCAGTCACTGAGTCTCCCAG GCC— 

•txx::.. : .: t s : :t: .t • is 

AGCTGAGTCTCACAAAGATCTAGGAAAGAAGGACACAGA^ACAGTGTACAGTG 

2240 2250 2260 2270 2280 2290 2300 

1650 1660 1670 1680 1690 1700 

inputs CCCTTGATCTGTACCCCAC CCCTA — TCTAACACCACCCTTG — GCTCCCACTCCAGCTC — CCT 

. : x ttz .s.:.t. :::::: :nti ::. 

CCCTGATGCOGTGGAAAGCAGATACTCTAGAACGGAAGGCTCCCTTGATGGAAC^ 
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2310 2320 2330 2340 2350 2360 2370 

1710 1720 1730 1740 1750 1760 
inputs GTATTGATATAACCTGTCAGG-CTGGCTTGGTTAG-GTTTTACTGGGG CAGAGGATAGGGA 



• • • • • • < 



GA — TGCACATCCCTGGAAGGACATCCATGTTCCGAGAAGAACAGATAATCCCTGTATTTCAAGACCTCT 
2380 2390 2400 2410 2420 2430 

1770 1780 1790 1800 1810 1820 

inputs -ATCTCTTATTAAAA CTAACATGAAATATGTGTTGTTTTCATTT — GCAAATTTAAATAAAGATACA 

!•!!!!!!.!. •• • • • • •••• • • • ■ ■ ■ 

* ••••••••#••• • • • •••• ♦ • ••••• •••• 

GTGCACTTATTTATGAACCTGCCCTGCTCCCACAGAACACAGCAATTCCTCAGGCTAAGCTGCCGGTTCT 
2440 2450 2460 2470 2480 2490 2500 



1830 1840 1850 1860 

input s TAAT GTTTGTATGAGATAAG AAAAAAAAAAAAAAAGGGCGGCCGC- 



TAAATCCATCCTGCTAAGTTAATGTTGGGTAGAAAGAGATACAGAGGGG 
2510 2520 2530 2540 2550 
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TANGO 281 

Input file AthPb81dl0.seq; Output File AthPb81dl0.pat 
Sequence length 1812 

M R L 3 

GTCGACCCACGCGTCCGGCGGAGGTTGTGGCTGCACCGTGGTCCTGGGCT ATG CGT CTG 73 

FV RIP S V R P A M A A P~ A P S P W T L 23 

TTT GTC CGT CCG TCC GTC CGT CCC GCC ATG GCT GCG CCG GCG CCC TCT CCG TGG ACC CTT 133 

SL LLLLLL P SPGAHGELCRP 43 

TCG CTG CTG CTG TTG TTG CTA CTG CCG TCT CCG GGT GCC CAT GGC GAG CTG TGC AGG CCC 193 

fge'dnsipescpdfccgscs 63 

TTC GGT GAA GAC AAT TCG ATC CCA GAG TCC TGT CCT GAC TTC TGT TGT GGC TCC TGT TCC 253 

SQYCC SDVLKKIQWNE E M C^ P 83 

AGC CAA TAC TGC TGC TCT GAC GTG CTG AAG AAA ATC CAG TGG AAT GAG GAA ATG TGC CCT 313 

E P E S S R F S A H PET P E Q L G S X 103 

GAG CCA GAG TCC AGC AGA TTT TCC GCC CAC CCG GAG ACA CCA GAA CAG CTG GpT TCA GCG 37 3 

LKYQSSLDSDNMPGFGATV'A 123 

CTG AAG TAT CAG TCC AGT CTT GAC AGT GAC AAC ATG CCA GGG TTC GGA GCG ACC GTG GCC 43 3 

IGLTVFV VFIATI IVCFTCS 143 

ATC GGC CTG ACC GTC "TTC GTG GTG TTT ATC GCT ACC ATC ATT GTG TGC TTT ACC TGC TCC 493 

• C C C L Y K M C C RPR P V V S N T T T 163 

TGC TGC TGT CTA TAT AAG ATG TGC TGC CGC CCA CGA CCT GTC GTG TCC AAC ACC ACA ACT "3 

TTVV H TA Y P Q PQ P V A P S Y P G 183 

ACT ACC GTG GTT CAC ACC GCT TAC CCT CAG CCT CAA CCT GTG GCC CCC AGC TAT CCT GGA 613 

PTYQGYHPMPPQP GMPAAPY 203 

CCA ACA TAC CAG GGC TAC CAT CCC ATG CCC CCC CAG CCA GGA ATG CCA GCA GCA CCC TAC 673 

PTQYPPPYLAQPTGPPAYHE 223 

CCA ACG CAG TAC CCT CCA CCC TAC CTG GCC CAG CCC ACA GGG CCA CCA GCC TAT CAT GAG 73 3 

TLAGASQPPYNPAYMDPPKA 243. 

ACG TTG GCT GGA GCC AGC CAG CCT CCA TAC AAC CCG GCC TAC ATG GAT CCC CCA AAG GCA 793 

VP* 246 
GTT CCC TGA 802 

GCCTGCCCCCAGCCTCTTTGGCTAACATTTGATTATC 881 

TGTGGTGCGTGTGCCTTGTCTAGA 960 

GGGACCAATCTGTTTTCTTCCTCACTTGAAATTG 1039 

TATTTCCCGCTTCACCXCAAG 1118 

GCTTTGGGGAGTAGCCAGCTAGCTGCTGCTATGGGTTTA 1197 

GGGCCTCAGTGGCAGTGGGTCAGTGACTGATGTCAGAG 1276 

GTCTGGACGGTCCCACTGTCT1TCCTGGGA 1355 

GGCCAGGGGCCTCTGTCTACTACACACTCTGGT^ 1434 

TTTTTGTATCCAGATGTGtGAT^ 1513 

FIG.28A 
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TCCGTTTCCCTTG ACACCAGCTTCATAGAATACCTG ACTCCTGTACTACAGTCC AGTTTGTTCC AGTAGCAGGG ACACC 1592 
AGGGCCAGGGGTTATCTGGACCAAGGGTGGGGGTGGAGAGCCTGGATGGTAGC 1671 
ATTCCCTGTTGGTTCCTGTTTCACTGGCTGTTTTAGTTTTC 1750 
TCGTTT AT AAT AAATG AA T ATTTG G AAAAAAAAAAAAAAAAAAAAAAAAAA 1812 



FIG. 28B 



55/61 



WO 00/78808 



PCT/US00/16883 




2.7 7.1 1 11 1 '"I. ibm 

| i | i | i | i , . | i | i | i | i | i | i | I , 
1 41 81 121 161 201 241 



>hT281 

MRLFVRPSVRPAMAAPAPSPWTLSLLLLLLLPSPGAHGELCRPFGEDNSIPESCPDFCCG 
SCSSQYCCSDVLKKIQV/NEEMCPEPESSRFSAHPETPEQLGSALKYQSSLDSDNMPGFGA 
TVAIGLTVFWFIATI IVCFTCSCCCLYKMCCRPRPWSNTTTTTVVHTAYPQPQPVAPS 
YPGPTYQGYHPMPPQPGMPAAPyPTQYPPPYLAQPTGPPAYHETLAGASQPPYNPAYMDP 
PKAVP 



FIG. 29 
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Alignments of top-scoring domains: 

PSBH: domain 1 of 1, from 97 to 146: score 5.5, E = 8.5 

*->ktalgelLkPlnseyGKvaPgV/GttplmgvfmalfavFLliileiYn 
♦ lg+ Lk * s + Pg+G t+ +g +++f+vF+ i+ + 

hT281 97 PEQLGSALKYQSSLDSDNMPGFGATVAIG - -LTVFWFIATI I VCFT 141 



ssvll<-* 

' s 

hT281 142 CSCCC . 146 



FIG. 30 
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Input file T281Atmea49d3; Output File T281Atmea49d3.pat 
Sequence length 1858 

jTCGACCCACGCGTCCGCGCGGAGGTTGCGGC^ 79 

MAAPAPSLWTLLLLLLL 17 

rTGTCCCGCC ATG GCT GCG CCG GCG CCC TCT CTG TGG ACC CTA TTG CTG CTG CTG TTG CTG 140 

lppppgahge-'lcrpfgedns 37 

ttg ccg ccg cct ccg ggt gcc cat ggt gag ctg tgc agg ccc ttt ggt gaa gac aat tcg 200 

ipvfcpdfccgscsnqyccs 57 

vTC CCA GTG TTC TGT CCT GAT TTC TGT TGT GGT TCC TGT TCC AAC CAA TAC TGC TGC TCG 260 

DVLRKIQWNEEMCPEPESSR 77 

AC GTG CTG AGG AAA ATC CAG TGG AAT GAG GAA ATG TGT CCT GAG CCA GAG TCC AGC AGA 320 

FSTPAEETPEHLGSALKFRS 97 

TT TCC ACC CCC GCG GAG GAG ACA CCC GAA CAT CTG GGT TCA GCG CTG AAA TTT CGA TCC 380 

SFDSDPMSGFGATVAIGVTI 117 

3T TTT GAC AGT GAC CCT ATG TCA GGG TTC GGA GCG ACC GTC GCC ATT GGC GTG ACC ATC 44 0 

?VVFIATI IICFTCSCCCLY 137 

TT GTG GTG TTT ATT GCC ACT ATC ATC ATC TGC TTC ACC TGC TCC TGC TGC TGT CTG TAT 500 

<MCCPQRPVVTNTTTTTVVH 157 

VG ATG TGC TGC CCC CAA CGC CCT GTC GTG ACC AAC ACC ACA ACT ACT ACC GTG GTT CAT 560 

^PYPQPQ.PQPVAPSYPGPTY 177 

2C CCT TAC CCT CAG CCT CAA CCT CAA CCT GTG GCC CCC AGC TAT CCT GGA CCA ACA TAC 620 

1GYHPMPPPARNASSTLPNA 197 

k G GGC TAC CAT CCC ATG CCC CCC CCA GCC AGG AAT GCC AGC AGC ACC CTA CCC AAC GCA 680 

PTTLPGPAHRAATLP* 214 

A CCC ACC ACC CTA CCT GGC CCA GCC CAC AGG GCC GCC ACC CTA CCA TGA 731 

'CCTTGGCTGGAGCCAGCCAGCCTCCATACAACCCGACCTACATG 810 

.GCCTCTTTGGCTGCCATTTATGTCGTGTGTGAGTGAGTGATACG 889 

TlXmrTAGACATGTGGCTTCCTCTGCTGTTGACCAGGTAGGC^ 968 

TCTTCCTCACTTCAAATTGTACTTTCTGAAATTT 1047 

OCCAAGGTCACX^GCCATGGCCTGTCATACra 1126 

2AGCTAGCTGCTGCTAGGCCTTTATTCCCAGGGTTTGGCT 1205 

IX3ACAAGGGGACTCAGTGGCAGGGGGTCACACCAGGCAGAA 1284 



FIG. 31A 
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ACTGTCCTTCCCX3GGGCTGTATAG AGGGCCACATGTGTTCACTATTCAGGCTCCACTGGGGGAATTTTCCTACCTTTG 1363 

TGGCTTGGCTCCTGCTCCCAGGCCAGGGACCTCGGTCTC 1442 

2TGTTAGCCAAACATTTTGCCTGTTTTCTGTCT 1 5 2 x 

\GK5ACAGACAACCTGACCTCCGACTGTCAGTTTCCCTTGACACCATCTTCATAGAAATA 1600 

rCAGTTTGTCCCAGTAGCAGGGACACCAAGGCCAATGGGTTATCTGGACCAAAGGTGGGGTGGAGGGCCTAGATGGTA 1679 

rTCCGGCCCAGATGTGAATACCTCCATATTCCCTGTTGGTTCCTGTTTCACTGGCTGTTTTAGCTTTGTGTTG ATTGG 1758 

JTTTCTGAGCATTCAGACTCCGCACCCTCATTTCTAATAAA 1837 

\AAAAAAAAAGGGCGGCCGC 1858 



FIG. 31 B 
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r>i M»t hits 

-%%A/"V>yv 



Cvi i mi i «»; 



2.4' 



■"7.0 



| l | i | • | • | < | 1 I 1 I ' 1 1 I 1 I 1 
1 41 Bl 121 161 201 

SIpSLVmJXLLLLLPPPPGAHGE^ 

^^EEMCPEPESSRFSTPAEETPEHIXSSALKFRSSFDSDPMSGFGAWAKVTIFW 
HPMPPPARNASSTLPNAVPTTLPGPAHRAATLP 



FIG. 32 
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ALIGN calculates a global alignment of two sequences 
version 2-0iiPlease cite: Myers and Miller, CABIOS (1989) 

> hT281 a. a. 245 aa vs. 

> roT281 a. a. 213 aa 
scoring matrix: paml20.mat, gap penalties: -12/-4 

66.5% identity; Global alignment score: 739 

10 20 30 40 50 60 70 

inputs' MRLFVRPSVRPAMAAPAPSPWTLSLLLLLLLPSPGAHGELCRPFGEDNSIPESCPDFC^ 

. ~ ■ _ «••••••••• 

— -^PAPSLWLLLLLiLLPPPPGAHGELCRPFGEDNSIPVFCP 

10 - 20 30 40 50 

80 90 100 110 120 130 

inputs VLKKIQWNEEMCPEPESSRFSAHPE-TPEQLGSALKYQSSLDSDNMPGFGATVAIGLTVFVVFIATIIVC 

VLRiaQVWE^^ 

60 70 80 90 100 110 120 

140 150 160 170 180 190 200 

inputs FTCSCCCLYKMCCRPRPWSNTTTTTVVHTAYPQPQP--VAPSYPGPTYQGYHPMPPQPGMPAAPYPTQY 

•,«...«_____. . . _ _ _ _________ •••••• • » _ 

FTCSCCCLYKMCCPQRPVVTNTTTTTVVHAPYPQPQPQPVAPSYPGPTYQGYHPMPP- PARN 

130 140 150 160 170 180 

210 220 . 230 240 

inputs P P P Y LAQ PTG P P AYHETLAG ASQ P P YN P AYMD P P KAV P 

AS STL - - PNAVPT TLPGPAHRA AT LP 

* 190 200 210 



FIG. 33 
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SEQUENCE LISTING 



<110> Millennium Pharmaceuticals, Inc. 

<120> SECRETED PROTEINS AND USES THEREOF 

<130> 7853-210-228 

<140> 
<141> 

<150> 09/336,536 
<151> 1999-06-18 

<160> 198 

<170> Patentln Ver. 2.0 

<210> 1 

<211> 1338 

<212> DNA 

<213> Homo sapiens 

<400> 1 

gtcgacccac gcgtccggga ctggggtgac ggcagggcag ggggcgcctg gccggggaga 60 
agcgcggggg ctggagcacc accaactgga gggtccggag tagcgagcgc cccgaaggag 12 0 
gccatcgggg agccgggagg ggggactgcg agaggacccc ggcgtccggg ctcccggtgc 180 
cagcgctatg aggccactcc tcgtcctgct gctcctgggc ctggcggccg gctcgccccc 240 
actggacgac aacaagatcc ccagcctctg cccggggcac cccggccttc caggcacgcc 3 00 
gggccaccat ggcagccagg gcttgccggg ccgcgatggc cgcgacggcc gcgacggcgc 3 60 
gcccggggct ccgggagaga aaggcgaggg cgggaggcgg gactgccggg acctcgaggg 420 
gaccccgggc cgcgaggaga ggcgggaccc gcggggccca ccgggcctgc cggggagtgc 4 80 
tcggtgcctc cgcgatccgc cttcagcgcc aagcgctccg agagccgggt gcctccgccg 540 
tctgacgcac ccttgccctt cgaccgcgtg ctggtgaacg agcagggaca ttacgacgcc 600 
gtcaccggca agttcacctg ccaggtgcct ggggtctact acttcgccgt ccatgccacc 660 
gtctaccggg ccagcctgca gtttgatctg gtgaagaatg gcgaatccat tgcctctttc 720 
ttccagtttt tcggggggtg gcccaagcca gcctcgctct cggggggggc catggtgagg 780 
ctggagcctg aggaccaagt gtgggtgcag gtgggtgtgg gtgactacat tggcatctat 840 
gccagcatca agacagacag caccttctcc ggatttctgg tgtactccga ctggcacagc 900 
tccccagtct ttgcttagtg cccactgcaa agtgagctca tgctctcact cctagaagga 960 
gggtgtgagg ctgacaacct ggtcatccag gagggctggc ccccctggaa tattgtgaat 102 0 
gactagggag gtggggtaga gcactctccg tcctgctgct ggcaaggaat gggaacagtg 1080 
gctgtctgcg atcaggtctg gcagcatggg gcagtggctg gatttctgcc caagaccaga 114 0 
ggagtgtgct gtgctggcaa gtgtaagtcc cccagttgct ctggtccagg agcccacggt 1200 
ggggtgctct cttcctggtc ctctgcttct ctggatcctc cccaccccct cctgctcctg 1260 
gggccggccc ttttctcaga gatcactcaa taaacctaag aaccctccaa aaaaaaaaaa 132 0 
aaaaaaaagg gcggccgc 1338 

<210> 2 

<211> 728 

<212> DNA 

<213> Homo sapiens 

<400> 2 

atgaggccac tcctcgtcct gctgctcctg ggcctggcgg ccggctcgcc cccactggac 60 
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gacaacaaga tccccagcct ctgcccgggg caccccggcc ttccaggcac gccgggccac 120 
catggcagcc agggcttgcc gggccgcgat ggccgcgacg gccgcgacgg cgcgcccggg 180 
gctccgggag agaaaggcga gggcgggagg cgggactgcc gggacctcga ggggaccccg 240 
ggccgcgagg agaggcggga cccgcggggc ccaccgggcc tgccggggag tgctcggtgc 300 
ctccgcgatc cgccttcagc gccaagcgct ccgagagccg ggtgcctccg ccgtctgacg 360 
cacccttgcc cttcgaccgc gtgctggtga acgagcaggg acattacgac gccgtcaccg 420 
gcaagttcac ctgccaggtg cctggggtct actacttcgc cgtccatgcc accgtctacc 480 
gggccagcct gcagtttgat ctggtgaaga atggcgaatc cattgcctct ttcttccagt 540 
ttttcggggg gtggcccaag ccagcctcgc tctcgggggg ggccatggtg aggctggagc 600 
ctgaggacca agtgtgggtg caggtgggtg tgggtgacta cattggcatc tatgccagca 660 
tcaagacaga cagcaccttc tccggatttc tggtgtactc cgactggcac agctccccag 720 
tctttgct 728 

<210> 3 
<211> 243 
<212> PRT 

<213> Homo sapiens 
<400> 3 

Met Arg Pro Leu Leu Val Leu Leu Leu Leu Gly Leu Ala Ala Gly Ser 
1 5 10 15 

Pro Pro Leu Asp Asp Asn Lys lie Pro Ser Leu Cys Pro Gly His Pro 
20 25 30 



Gly Leu Pro Gly Thr Pro Gly His His Gly Ser Gin Gly Leu Pro Gly 
35 40 45 

Arg Asp Gly Arg Asp Gly Arg Asp Gly Ala Pro Gly Ala Pro Gly Glu 
50 55 60 

Lys Gly Glu Gly Gly Arg Pro Gly Leu Pro Gly Pro Arg Gly Asp Pro 
65 70 75 80 

Gly Pro Arg Gly Glu Ala Gly Pro Ala Gly Pro Thr Gly Pro Ala Gly 
85 90 95 



Glu Cys Ser Val Pro Pro 
100 

Ser Arg Val Pro Pro Pro 
115 

Leu Val Asn Glu Gin Gly 
130 

Cys Gin Val Pro Gly Val 
145 150 

Arg Ala Ser Leu Gin Phe 
165 

Ser Phe Phe Gin Phe Phe 
180 

Gly Gly Ala Met Val Arg 
195 



Arg Ser Ala Phe Ser 
105 

Ser Asp Ala Pro Leu 
120 

His Tyr Asp Ala Val 
135 

Tyr Tyr Phe Ala Val 
155 

Asp Leu Val Lys Asn 
170 

Gly Gly Trp Pro Lys 
185 

Leu Glu Pro Glu Asp 
200 



Ala Lys Arg Ser Glu 
110 

Pro Phe Asp Arg Val 
125 

Thr Gly Lys Phe Thr 
140 

His Ala Thr Val Tyr 
160 

Gly Glu Ser He Ala 
175 

Pro Ala Ser Leu Ser 
190 

Gin Val Trp Val Gin 
205 
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Val Gly Val Gly Asp Tyr He Gly He Tyr Ala Ser He Lys Thr Asp 
210 215 220 

Ser Thr Phe Ser Gly Phe Leu Val Tyr Ser Asp Trp His Ser Ser Pro 
225 230 235 240 

Val Phe Ala 



<210> 4 

<211> 228 

<212> PRT 

<213> Homo sapiens 

<400> 4 

Ser Pro Pro Leu Asp Asp Asn Lys He Pro Ser Leu Cys Pro Gly His 
15 10 15 

Pro Gly Leu Pro Gly Thr Pro Gly His His Gly Ser Gin Gly Leu Pro 
20 25 30 

Gly Arg Asp Gly Arg Asp Gly Arg Asp Gly Ala Pro Gly Ala Pro Gly 
35 40 45 

Glu Lys Gly Glu Gly Gly Arg Pro Gly Leu Pro Gly Pro Arg Gly Asp 
50 55 60 

Pro Gly Pro Arg Gly Glu Ala Gly Pro Ala Gly Pro Thr Gly Pro Ala 
65 70 75 80 

Gly Glu Cys Ser Val Pro Pro Arg Ser Ala Phe Ser Ala Lys Arg Ser 
85 90 95 

Glu Ser Arg Val Pro Pro Pro Ser Asp Ala Pro Leu Pro Phe Asp Arg 
100 105 110 

Val Leu Val Asn Glu Gin Gly His Tyr Asp Ala Val Thr Gly Lys Phe 
115 120 125 

Thr Cys Gin Val Pro Gly Val Tyr Tyr Phe Ala Val His Ala Thr Val 
130 135 140 

Tyr Arg Ala Ser Leu Gin Phe Asp Leu Val Lys Asn Gly Glu Ser He 
145 150 155 160 

Ala Ser Phe Phe Gin Phe Phe Gly Gly Trp Pro Lys Pro Ala Ser Leu 
165 170 175 

Ser Gly Gly Ala Met Val Arg Leu Glu Pro Glu Asp Gin Val Trp Val 
180 185 190 

Gin Val Gly Val Gly Asp Tyr He Gly He Tyr Ala Ser He Lys Thr 
195 200 205 

Asp Ser Thr Phe Ser Gly Phe Leu Val Tyr Ser Asp Trp His Ser Ser 
210 215 220 
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Pro Val Phe Ala 
225 



<210> 5 
<211> 15 
<212> PRT 

<213> Homo sapiens 
<400> 5 

Met Arg Pro Leu Leu Val Leu Leu Leu Leu Gly Leu Ala Ala Gly 
15 10 15 



<210> 6 
<211> 60 
<212> PRT 

<213> Homo sapiens 
<400> 6 

Gly Thr Pro Gly His His Gly Ser Gin Gly Leu Pro Gly Arg Asp Gly 
1 5 10 15 



Arg Asp Gly Arg Asp Gly Ala Pro Sly Ala Pro Gly Glu Lys Gly Glu 
20 25 30 

Gly Gly Arg Pro Gly Leu Pro Gly Pro Arg Gly Asp Pro Gly Pro Arg 
35 40 45 

Gly Glu Ala Gly Pro Ala Gly Pro Thr Gly Pro Ala 
50 55 60 



<210> 7 
<211> 128 
<212> PRT 

<213> Homo sapiens 
<400> 7 

Ala Phe Ser Ala Lys Arg Ser Glu Ser Arg Val Pro Pro Pro Ser Asp 
1 5 10 15 

Ala Pro Leu Pro Phe Asp Arg Val Leu Val Asn Glu Gin Gly His Tyr 
20 25 30 

Asp Ala Val Thr Gly Lys Phe Thr Cys Gin Val Pro Gly Val Tyr Tyr 
35 40 45 

Phe Ala Val His Ala Thr Val Tyr Arg Ala Ser Leu Gin Phe Asp Leu 
50 55 60 

Val Lys Asn Gly Glu Ser lie Ala Ser Phe Phe Gin Phe Phe Gly Gly 
65 70 75 80 

Trp Pro Lys Pro Ala Ser Leu Ser Gly Gly Ala Met Val Arg Leu Glu 
85 90 95 

Pro Glu Asp Gin Val Trp Val Gin Val Gly Val Gly Asp Tyr lie Gly 
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100 105 110 

He Tyr Ala Ser He Lys Thr Asp Ser Thr Phe Ser Gly Phe Leu Val 
115 120 125 

<210> 8 
<211> 1263 
<212> DNA 

<213> Mus musculus 
<400> 8 

gtcgacccac gcgtccgcgc tgtgaagcca gcaaggagca accagaagct aggagtcagt 60 

cagcaaggac aggggctgcc tgcctacaga ctacaagaga ggttcctgga gtctgagcct 12 0 

ccggggtcac caccatgagg ccacttcttg cccttctgct tctgggtctg gtgtcaggct 180 

ctcctcctct ggacgacaac aagatcccca gcctgtgtcc cgggcagccc ggccttccag 240 

gcacaccagg tcaccatggc agccaaggcc tgcctggccg tgacggccgt gatggccgcg 300 

acggtgcacc cggagctccg ggagagaaag gcgagggcgg gagaccggga ctacctggcc 360 

cacgtgggga gcccgggccg cgtggagagg cagggcccat gggggctatc gggcctgcgg 420 

gggagtgctc ggtaccccca cgatcagcct tcagtgccaa gcgatccgag agccgggtac 480 

ctccgccagc cgacacaccc ctacctttcg accgtgtgct gctaaatgag cagggccatt 540 

acgaccccac tactggcaag ttcacctgcc aagtgcctgg cgtctactac tttgctgtgc 600 

acgccactgt ctaccgggcc agcttgcagt ttgatcttgt caaaaacggg cagtccatcg 660 

cctctttctt ccagtatttt ggggggtggc ccaagccagc ctcgctctca gggggtgcga 720 

tggtaaggct agaacctgag gaccaggtgt gggtgcaggt gggcgtgggt gattacattg 780 

gcatctatgc cagcatcaag acagacagta ccttctctgg atttctcgtc tattctgact 84 0 

ggcacagctc cccagtcttc gcttaaaaca cagtgaaccc ggagctggca cttgctcctc 900 

agtggagggt gtgacactaa cccgcgcagc gcataccagg agggctggcc ccctggaata 960 

ttgtgaatga cttaggaaga gagggagcca cttccagtcc cactgctggc aatgaatgga 1020 

gacaggctgt ctgaggtcaa gacagcgtgg agcagtggct gggtttctgc ccaggacttt 1080 

agaatgcagt aggctggcag ctgtgggtcc tggcccagga ctccaaggtg ggatgctcca 1140 

ttcctagtcc tgtgtcccct ctaggtccct gactccatct ctgctgctcc cagggcaggc 1200 

ctttttctca gaggtcactt aataaaccta aaatcctcaa aaaaaaaaaa aaagggcggc 12 60 
cgc 1263 

<210> 9 
<211> 729 
<212> DNA 

<213> Mus musculus 
<400> 9 

atgaggccac ttcttgccct tctgcttctg ggtctggtgt caggctctcc tcctctggac 60 

gacaacaaga tccccagcct gtgtcccggg cagcccggcc ttccaggcac accaggtcac 120 

catggcagcc aaggcctgcc tggccgtgac ggccgtgatg gccgcgacgg tgcacccgga 180 

gctccgggag agaaaggcga gggcgggaga ccgggactac ctggcccacg tggggagccc 240 

gggccgcgtg gagaggcagg gcccatgggg gctatcgggc ctgcggggga gtgctcggta 300 

cccccacgat cagccttcag tgccaagcga tccgagagcc gggtacctcc gccagccgac 360 

acacccctac ctttcgaccg tgtgctgcta aatgagcagg gccattacga ccccactact 42 0 

ggcaagttca cctgccaagt gcctggcgtc tactactttg ctgtgcacgc cactgtctac 4 80 

cgggccagct tgcagtttga tcttgtcaaa aacgggcagt ccatcgcctc tttcttccag 54 0 

tattttgggg ggtggcccaa gccagcctcg ctctcagggg gtgcgatggt aaggctagaa 600 

cctgaggacc aggtgtgggt gcaggtgggc gtgggtgatt acattggcat ctatgccagc 660 

atcaagacag acagtacctt ctctggattt ctcgtctatt ctgactggca cagctcccca 720 
gtcttcgct 729 

<210> 10 
<211> 243 
<212> PRT 

<213> Mus musculus 
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<400> 10 

Met Arg Pro Leu Leu Ala Leu Leu Leu Leu Gly Leu Val Ser Gly Ser 
1 5 10 15 

Pro Pro Leu Asp Asp Asn Lys lie Pro Ser Leu Cys Pro Gly Gin Pro 
20 25 30 

Gly Leu Pro Gly Thr Pro Gly His His Gly Ser Gin Gly Leu Pro Gly 
35 40 45 

Arg Asp Gly Arg Asp Gly Arg Asp Gly Ala Pro Gly Ala Pro Gly Glu 
50 55 60 

Lys Gly Glu Gly Gly Arg Pro Gly Leu Pro Gly Pro Arg Gly Glu Pro 
65 70 75 80 

Gly Pro Arg Gly Glu Ala Gly Pro Met Gly Ala He Gly Pro Ala Gly 
85 90 95 

Glu Cys Ser Val Pro Pro Arg Ser Ala Phe Ser Ala Lys Arg Ser Glu 
100 105 110 

Ser Arg Val Pro Pro Pro Ala Asp Thr Pro Leu Pro Phe Asp Arg Val 
115 120 125 

Leu Leu Asn Glu Gin Gly His Tyr Asp Pro Thr Thr Gly Lys Phe Thr 
130 135 140 

Cys Gin Val Pro Gly Val Tyr Tyr Phe Ala Val His Ala Thr Val Tyr 
145 150 155 160 

Arg Ala Ser Leu Gin Phe Asp Leu Val Lys Asn Gly Gin Ser He Ala 
165 170 175 

Ser Phe Phe Gin Tyr Phe Gly Gly Trp Pro Lys Pro Ala Ser Leu Ser 
180 185 190 

Gly Gly Ala Met Val Arg Leu Glu Pro Glu Asp Gin Val Trp Val Gin 
195 200 205 

Val Gly Val Gly Asp Tyr He Gly He Tyr Ala Ser He Lys Thr Asp 
210 215 220 

Ser Thr Phe Ser Gly Phe Leu Val Tyr Ser Asp Trp His Ser Ser Pro 
225 230 235 240 

Val Phe Ala 



<210> 11 
<211> 228 
<212> PRT 

<213> Mus musculus 
<400> 11 

Ser Pro Pro Leu Asp Asp Asn Lys He Pro Ser Leu Cys Pro Gly Gin 
15 10 15 
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Pro Gly Leu Pro Gly Thr Pro Gly His His Gly Ser Gin Gly Leu Pro 
20 ' 25 30 

Gly Arg Asp Gly Arg Asp Gly Arg Asp Gly Ala Pro Gly Ala Pro Gly 
3 5 40 45 

Glu Lys Gly Glu Gly Gly Arg Pro Gly Leu Pro Gly Pro Arg Gly Glu 
50 55 60 

Pro Gly Pro Arg Gly Glu Ala Gly Pro Met Gly Ala lie Gly Pro Ala 
65 70 75 80 

Gly Glu Cys Ser Val Pro Pro Arg Ser Ala Phe Ser Ala Lys Arg Ser 
85 90 95 

Glu Ser Arg Val Pro Pro Pro Ala Asp Thr Pro Leu Pro Phe Asp Arg 
100 105 110 

Val Leu Leu Asn Glu Gin Gly His Tyr Asp Pro Thr Thr Gly Lys Phe 
115 120 125 

Thr Cys Gin Val Pro Gly Val Tyr Tyr Phe Ala Val His Ala Thr Val 
130 135 140 

Tyr Arg Ala Ser Leu Gin Phe Asp Leu Val Lys Asn Gly Gin Ser lie 
145 150 155 160 

Ala Ser Phe Phe Gin Tyr Phe Gly Gly Trp Pro Lys Pro Ala Ser Leu 
165 170 175 

Ser Gly Gly Ala Met Val Arg Leu Glu Pro Glu Asp Gin Val Trp Val 
180 185 190 

Gin Val Gly Val Gly Asp Tyr He Gly He Tyr Ala Ser He Lys Thr 
195 200 205 

Asp Ser Thr Phe Ser Gly Phe Leu Val Tyr Ser Asp Trp His Ser Ser 
210 • 215 220 



Pro Val Phe Ala 
225 



<210> 12 

<211> 15 

<212> PRT 

<213> Mus musculus 

<400> 12 

Met Arg Pro Leu Leu Ala Leu Leu Leu Leu Gly Leu Val Ser Gly 
15 10 15 



<210> 13 

<211> 60 

<212> PRT 

<213> Mus musculus 
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<400> 13 
Gly Thr Pro Gly 
1 

Arg Asp Gly Arg 
20 

Gly Gly Arg Pro 
35 

Gly Glu Ala Gly 
50 



His His Gly Ser 
5 

Asp Gly Ala Pro 

Gly Leu Pro Gly 
40 

Pro Met Gly Ala 
55 



Gin Gly Leu Pro 
10 

Gly Ala Pro Gly 
25 

Pro Arg Gly Glu 



He Gly Pro Ala 
60 



Gly Arg Asp Gly 
15 

Glu Lys Gly Glu 
30 

Pro Gly Pro Arg 
45 



<210> 14 
<211> 128 
<212> PRT 

<213> Mus musculus 
<400> 14 

Ala Phe Ser Ala Lys Arg Ser Glu Ser Arg Val Pro Pro Pro Ala Asp 
1 5 10 15 

Thr Pro Leu Pro Phe Asp Arg Val Leu Leu Asn Glu Gin Gly His Tyr 
20 25 30 

Asp Pro Thr Thr Gly Lys Phe Thr Cys Gin Val Pro Gly Val Tyr Tyr 
35 40 45 

Phe Ala Val His Ala Thr Val Tyr Arg Ala Ser Leu Gin Phe Asp Leu 
50 55 60 

Val Lys Asn Gly Gin Ser He Ala Ser Phe Phe Gin Tyr Phe Gly Gly 
65 70 75 80 

Trp Pro Lys Pro Ala Ser Leu Ser Gly Gly Ala Met Val Arg Leu Glu 
85 90 95 

Pro Glu Asp Gin Val Trp Val Gin Val Gly Val Gly Asp Tyr He Gly 
100 105 " 110 

He Tyr Ala Ser He Lys Thr Asp Ser Thr Phe Ser Gly Phe Leu Val 
115 120 125 



<210> 15 
<211> 1831 
<212> DNA 

<213> Homo sapiens 



acgttccttc 60 
catcttgttc 120 
gtacatggaa 180 
tagtcggcat 240 
ggcagagaag 3 00 
tcgtctggag 360 
gtttgatgag 420 



<400> 15 

gtcgacccac gcgtccgcgg acgcgtgggt gaggggaaga ggctgactgt 

tactctggca ccactctcca ggctgccatg gggcccagca cccctctcct 

cttttgtcat ggtcgggacc cctccaagga cagcagcacc accttgtgga 

cgccgactag ctgctttaga ggaacggctg gcccagtgcc aggaccagag 

gctgctgagc tgcgggactt caagaacaag atgctgccac tgctggaggt 

gagcgggagg cactcagaac tgaggccgac accatctccg ggagagtgga 

cgggaggtag actatctgga gacccagaac ccagctctgc cctgtgtaga 
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aaggtgactg gaggccctgg gaccaaaggc aagggaagaa ggaatgagaa gtacgatatg 4 80 
gtgacagact gtggctacac aatctctcaa gtgagatcaa tgaagattct gaagcgattt 540 
ggtggcccag ctggtctatg gaccaaggat ccactggggc aaacagagaa gatctacgtg 600 
ttagatggga cacagaatga cacagccttt gtcttcccaa ggctgcgtga cttcaccctt 660 
gccatggctg cccggaaagc ttcccgagtc cgggtgccct tcccctgggt aggcacaggg 720 
cagctggtat atggtggctt tctttatttt gctcggaggc ctcctggaag acctggtgga 780 
ggtggtgaga tggagaacac tttgcagcta atcaaattcc acctggcaaa ccgaacagtg 840 
gtggacagct cagtattccc agcagagggg ctgatccccc cctacggctt gacagcagac 900 
acctacatcg acctggcagc tgatgaggaa ggtctttggg ctgtctatgc cacccgggag 960 
gatgacaggc acttgtgtct ggccaagtta gatccacaga cactggacac agagcagcag 102 0 
tgggacacac catgtcccag agagaatgct gaggctgcct ttgtcatctg tgggaccctc 1080 
tatgtcgtct ataacacccg tcctgccagt cgggcccgca tccagtgctc ctttgatgcc 1140 
agcggcaccc tgacccctga acgggcagca ctcccttatt ttccccgcag atatggtgcc 1200 
catgccagcc tccgctataa cccccgagaa cgccagctct atgcctggga tgatggctac 1260 
cagattgtct ataagctgga gatgaggaag aaagaggagg aggtttgagg agctagcctt 1320 
gttttttgca tctttctcac tcccatacat ttatattata tccccactaa atttcttgtt 1380 
cctcattctt caaatgtggg ccagttgtgg ctcaaatcct ctatattttt agccaatggc 1440 
aatcaaattc tttcagctcc tttgtttcat acggaactcc agatcctgag taatcctttt 1500 
agagcccgaa gagtcaaaac cctcaatgtt ccctcctgct ctcctgcccc atgtcaacaa 1560 
atttcaggct aaggatgccc cagacccagg gctctaacct tgtatgcggg caggcccagg 1620 
gagcaggcag cagtgttctt cccctcagag tgacttgggg agggagaaat aggaggagac 1680 
gtccagctct gtcctctctt cctcactcct cccttcagtg tcctgaggaa caggactttc 1740 
tccacattgt tttgtattgc aacattttgc attaaaagga aaatccactg ctaaaaaaaa 1800 
aaaaaaaaaa aaaaaaaaaa agggcggccg c 1831 

<210> 16 
<2ll> 1218 
<212> DNA 

<213> Homo sapiens 
<400> 16 

atggggccca gcacccctct cctcatcttg ttccttttgt catggtcggg acccctccaa 60 
ggacagcagc accaccttgt ggagtacatg gaacgccgac tagctgcttt agaggaacgg 120 
ctggcccagt gccaggacca gagtagtcgg catgctgctg agctgcggga cttcaagaac 180 
aagatgctgc cactgctgga ggtggcagag aaggagcggg aggcactcag aactgaggcc 240 
gacaccatct ccgggagagt ggatcgtctg gagcgggagg tagactatct ggagacccag 300 
aacccagctc tgccctgtgt agagtttgat gagaaggtga ctggaggccc tgggaccaaa 360 
ggcaagggaa gaaggaatga gaagtacgat atggtgacag actgtggcta cacaatctct 420 
caagtgagat caatgaagat tctgaagcga tttggtggcc cagctggtct atggaccaag 480 
gatccactgg ggcaaacaga gaagatctac gtgttagatg ggacacagaa tgacacagcc 540 
tttgtcttcc caaggctgcg tgacttcacc cttgccatgg ctgcccggaa agcttcccga 600 
gtccgggtgc ccttcccctg ggtaggcaca gggcagctgg tatatggtgg ctttctttat 660 
tttgctcgga ggcctcctgg aagacctggt ggaggtggtg agatggagaa cactttgcag 720 
ctaatcaaat tccacctggc aaaccgaaca gtggtggaca gctcagtatt cccagcagag 780 
gggctgatcc ccccctacgg cttgacagca gacacctaca tcgacctggc agctgatgag 840 
gaaggtcttt gggctgtcta tgccacccgg gaggatgaca ggcacttgtg tctggccaag 900 
ttagatccac agacactgga cacagagcag cagtgggaca caccatgt.cc cagagagaat 960 
gctgaggctg cctttgtcat ctgtgggacc ctctatgtcg tctataacac ccgtcctgcc 1020 
agtcgggccc gcatccagtg ctcctttgat gccagcggca ccctgacccc tgaacgggca 1080 
gcactccctt attttccccg cagatatggt gcccatgcca gcctccgcta taacccccga 1140 
gaacgccagc tctatgcctg ggatgatggc taccagattg tctataagct ggagatgagg 12 00 
aagaaagagg aggaggtt 1218 

<210> 17 
<211> 406 
<212> PRT 

<213> Homo sapiens 
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<400> 17 

Met Gly Pro Ser Thr Pro Leu Leu He Leu Phe Leu Leu Ser Trp Ser 
1 5 10 15 

Gly Pro Leu Gin Gly Gin Gin His His Leu Val Glu Tyr Met Glu Arg 
20 25 30 

Arg Leu Ala Ala Leu Glu Glu Arg Leu Ala Gin Cys Gin Asp Gin Ser 
35 40 45 

Ser Arg His Ala Ala Glu Leu Arg Asp Phe Lys Asn Lys Met Leu Pro 
50 55 60 

Leu Leu Glu Val Ala Glu Lys Glu Arg Glu Ala Leu Arg Thr Glu Ala 
65 70 75 80 

Asp Thr He Ser Gly Arg Val Asp Arg Leu Glu Arg Glu Val Asp Tyr 
85 90 95 

Leu Glu Thr Gin Asn Pro Ala Leu Pro Cys Val Glu Phe Asp Glu Lys 
100 105 110 

Val Thr Gly Gly Pro Gly Thr Lys Gly Lys Gly Arg Arg Asn Glu Lys 
115 120 125 

Tyr Asp Met Val Thr Asp Cys Gly Tyr Thr He Ser Gin Val Arg Ser 
130 135 140 

Met Lys He Leu Lys Arg Phe Gly Gly Pro Ala Gly Leu Trp Thr Lys 
145 150 155 ~ 160 

Asp Pro Leu Gly Gin Thr Glu Lys He Tyr Val Leu Asp Gly Thr Gin 
165 170 ~ 175 

Asn Asp Thr Ala Phe Val Phe Pro Arg Leu Arg Asp Phe Thr Leu Ala 
180 185 190 

Met Ala Ala Arg Lys Ala Ser Arg Val Arg Val Pro Phe Pro Trp Val 
195 200 205 

Gly Thr Gly Gin Leu Val Tyr Gly Gly Phe Leu Tyr Phe Ala Arg Arg 
210 215 220 

Pro Pro Gly Arg Pro Gly Gly Gly Gly Glu Met Glu Asn Thr Leu Gin 
225 230 235 240 

Leu He Lys Phe His Leu Ala Asn Arg Thr Val Val Asp Ser Ser Val 
245 250 " 255 

Phe Pro Ala Glu Gly Leu He Pro Pro Tyr Gly Leu Thr Ala Asp Thr 
260 265 270 

Tyr He Asp Leu Ala Ala Asp Glu Glu Gly Leu Trp Ala Val Tyr Ala 
275 280 285 

Thr Arg Glu Asp Asp Arg His Leu Cys Leu Ala Lys Leu Asp Pro Gin 
290 295 300 
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Thr Leu 
305 



Asp Thr 



Ala Glu Ala Ala 



Thr Arg 
Gly Thr 



Tyr Gly 
370 

Tyr Ala 
385 



Pro Ala 
340 

Leu Thr 
355 



Glu Gin Gin Trp Asp Thr Pro Cys Pro 
310 315 

Phe Val lie Cys Gly Thr Leu Tyr Val 
325 330 

Ser Arg Ala Arg lie Gin Cys Ser Phe 
345 



Pro Glu Arg Ala Ala Leu Pro 
360 



Ala His Ala Ser 



Leu Arg Tyr Asn Pro 
375 



Tyr Phe 
365 

Arg Glu 
380 



Trp Asp 



Lys Lys Glu Glu 



Asp Gly 
390 

Glu Val 
405 



Tyr Gin lie Val Tyr Lys Leu 
395 



Arg Glu Asn 
320 

Val Tyr Asn 
335 

Asp Ala Ser 
350 

Pro Arg Arg 
Arg Gin Leu 



Glu Met Arg 
400 



<210> 18 
<211> 385 
<212> PRT 

<213> Homo sapiens 
<400> 18 

Gin Gin His His Leu Val Glu Tyr Met Glu Arg Arg Leu Ala Ala Leu 
15 10 15 

Glu Glu Arg Leu Ala Gin Cys Gin Asp Gin Ser Ser Arg His Ala Ala 
20 25 30 

Glu Leu Arg Asp Phe Lys Asn Lys Met Leu Pro Leu Leu Glu Val Ala 
35 40 45 

Glu Lys Glu Arg Glu Ala Leu Arg Thr Glu Ala Asp Thr lie Ser Gly 
5.0 55 60 

Arg Val Asp Arg Leu Glu Arg Glu Val Asp Tyr Leu Glu Thr Gin Asn 
65 70 75 80 

Pro Ala Leu Pro Cys Val Glu Phe Asp Glu Lys Val Thr Gly Gly Pro 
85 90 95 

Gly Thr Lys Gly Lys Gly Arg Arg Asn Glu Lys Tyr Asp Met Val Thr 
100 105 110 

Asp Cys Gly Tyr Thr lie Ser Gin Val Arg Ser Met Lys lie Leu Lys 
115 120 125 

Arg Phe Gly Gly Pro Ala Gly Leu Trp Thr Lys Asp Pro Leu Gly Gin 
130 135 140 

Thr Glu Lys lie Tyr Val Leu Asp Gly Thr Gin Asn Asp Thr Ala Phe 
145 150 155 160 
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Val Phe Pro Arg Leu 
165 



Arg Asp Phe Thr Leu Ala Met Ala Ala Arg Lys 
170 175 



Ala Ser Arg Val Arg 
180 



Val Pro Phe Pro Trp Val Gly Thr Gly Gin Leu 
185 190 



Val Tyr Gly Gly Phe 
195 



Leu Tyr Phe Ala Arg Arg Pro Pro Gly Arg Pro 
200 ' 205 



Gly Gly Gly Gly Glu 
210 



Met Glu Asn Thr Leu Gin Leu lie Lys Phe His 
215 220 



Leu Ala Asn Arg Thr 
225 



Val Val Asp Ser Ser Val Phe Pro Ala Glu Gly 
230 235 240 



Leu lie Pro Pro Tyr 
245 



Gly Leu Thr Ala Asp Thr Tyr lie Asp Leu Ala 
250 255 



Ala Asp Glu Glu Gly 
260 



Leu Trp Ala Val Tyr Ala Thr Arg Glu Asp Asp 
265 270 



Arg His Leu Cys Leu 
275 



Ala Lys Leu Asp Pro Gin Thr Leu Asp Thr Glu 
280 285 



Gin Gin Trp Asp Thr 
290 



Pro Cys Pro Arg Glu Asn Ala Glu Ala Ala Phe 
295 300 



Val He Cys Gly Thr 
305 



Leu Tyr Val Val Tyr Asn Thr Arg Pro Ala Ser 
310 315 320 



Arg Ala Arg He Gin 
325 



Cys Ser Phe Asp Ala Ser Gly Thr Leu Thr Pro 
330 335 



Glu Arg Ala Ala Leu 
340 



Pro Tyr Phe Pro Arg Arg Tyr Gly Ala His Ala 
345 350 



Ser Leu Arg Tyr Asn 
355 



Pro Arg Glu Arg Gin Leu Tyr Ala Trp Asp Asp 
360 365 



Gly Tyr Gin He Val 
370 



Tyr Lys Leu Glu Met Arg Lys Lys Glu Glu Glu 
375 380 



Val 
385 



<210> 19 
<211> 21 
<212> PRT 

<213> Homo sapiens 
<400> 19 

Met Gly Pro Ser Thr Pro Leu Leu He Leu Phe Leu Leu Ser Trp Ser 
15 10 15 

Gly Pro Leu Gin Gly 
20 
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<210> 20 
<211> 244 
<212> PRT 

<213> Homo sapiens 
<400> 20 

Met Leu Leu Leu Gly Ala Val Leu Leu Leu Leu Ala Leu Pro Gly His 
1 5 10 15 

Asp Gin Glu Thr Thr Thr Gin Gly Pro Gly Val Leu Leu Pro Leu Pro 
20 25 " 30 

Lys Gly Ala Cys Thr Gly Trp Met Ala Gly He Pro Gly His Pro Gly 
35 40 45 

His Asn Gly Ala Pro Gly Arg Asp Gly Arg Asp Gly Thr Pro Gly Glu 
50 55 60 

Lys Gly Glu Lys Gly Asp Pro Gly Leu He Gly Pro Lys Gly Asp He 
65 70 75 80 

Gly Glu Thr Gly Val Pro Gly Ala Glu Gly Pro Arg Gly Phe Pro Gly 
85 90 95 

He Gin Gly Arg Lys Gly Glu Pro Gly Glu Gly Ala Tyr Val Tyr Arg 
100 105 * 110 

Ser Ala Phe Ser Val Gly Leu Glu Thr Tyr Val Thr He Pro Asn Met 
115 120 125 

Pro He Arg Phe Thr Lys He Phe Tyr Asn Gin Gin Asn His Tyr Asp 
130 135 140 

Gly Ser Thr Gly Lys Phe His Cys Asn He Pro Gly Leu Tyr Tyr Phe 
145 150 155 " 160 

Ala Tyr His He Thr Val Tyr Met Lys Asp Val Lys Val Ser Leu Phe 
165 170 175 

Lys Lys Asp Lys Ala Met Leu Phe Thr Tyr Asp Gin Tyr Gin Glu Asn 
180 185 * 190 

Asn Val Asp Gin Ala Ser Gly Ser Val Leu Leu His Leu Glu Val Gly 
195 200 205 

Asp Gin Val Trp Leu Gin Val Tyr Gly Glu Gly Glu Arg Asn Gly Leu 
210 215 220 

Tyr Ala Asp Asn Asp Asn Asp Ser Thr Phe Thr Gly Phe Leu Leu Tyr 
225 230 235 240 

His Asp Thr Asn 



<210> 21 
<211> 1721 
<212> DNA 
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<213> Mus musculus 
<400> 21 

gtcgacccac gcgtccgact taaggctgcc atggggccca gtgctcctct gctgctcctc 60 
ttctttttgt catggacggg accccttcag ggacagcagc accaccttgt ggagtacatg 120 
gaacgccgac tagctgcctt agaggaacgg ctggcccaat gccaggatca gagtagtcgg 180 
catgctgccg agcttcggga cttcaaaaac aagatgttgc ctctcctgga ggtggcagag 240 
aaggagcggg agaccctcag aactgaagca gactccatct caggaagagt ggaccgtctt 300 
gaaagggagg tagactatct ggagacacag aacccagctt tgccctgtgt agagctggat 360 
gagaaggtga ctggaggtcc tggagccaaa ggcaagggcc gaagaaatga gaaatacgat 420 
atggtgacgg actgtagcta cacagtcgct caggtgaggt caatgaagat cctgaagcgg 4 80 
tttggtggtt cagttggcct atggaccaag gatccgctgg ggccagcaga gaagatctac 54 0 
gtgttagacg gcacccagaa cgacacggct tttgtcttcc caaggctgcg tgacttcacc 600 
cttgccatgg ctgcccggaa agcttcccga attcgggtgc ccttcccctg ggtaggcacg 660 
gggcagctgg tgtacggtgg cttcctttat tatgctcgaa ggcctcctgg aggacctgga 72 0 
gggggtggtg aattggagaa cactctgcag ctgatcaaat ttcacttggc aaaccgaaca 780 
gtggtggata gctcagtgtt ccctgcagag agcctgatac ccccctacgg cctgacagca 840 
gatacatata tcgacctggc agctgatgag gagggcctgt gggctgtcta tgccactcga 900 
gatgatgaca ggcatttgtg tctagccaag ttagacccac agacacttga cacagagcag 960 
cagtgggaca caccatgtcc cagagagaac gcagaggctg cgtttgtcat ctgtgggacc 102 0 
ctgtacgttg tctataacac ccgccctgcc agtagggctc gtattcagtg ttccttcgat 1080 
gccagtggta ctctcgcccc tgaaagggca gcactctcct attttccacg ccgatatggt 1140 
gcccatgcca gccttcgcta taacccccgt gagcgccagc tgtatgcctg ggatgatggc 1200 
taccagattg tctacaaatt ggagatgaag aagaaggagg aggaagttta agcagctagc 1260 
cttgtgctct tgattcttat gcccagacat ttatattcct gtgagctctc ctgcagttca 1320 
tccttcaaaa cgaaggccag tggtggtagc tcatataccc taatttctaa aggacaacca 1380 
aattctcaag cccctctgtt ttatgcagaa ctccagatcc tgggtagcat tttagaactg 144 0 
aacagcaaac aaacacccta aatcttcact cctgccttat gtccacaaag tttagttcca 1500 
aactcagagc cctgtccttt ggagagggtc aaccccagac agcaggcgac agcattcttg 1560 
ccctcagtat gaccgaaggg agagaactca gagacaaagc tgccctccct cccttccccc 1620 
tccagtgtag gggagaatgg ggctttcccc acatcacttt gtatggtaac agtttgcatt 1680 
aaaaggaaaa cccaccaaaa aaaaaaaaaa agggcggccg c 1721 

<210> 22 
<211> 1218 
<212> DNA 

<213> Mus musculus 
<400> 22 

atggggccca gtgctcctct gctgctcctc ttctttttgt catggacggg accccttcag 60 

ggacagcagc accaccttgt ggagtacatg gaacgccgac tagctgcctt agaggaacgg 120 

ctggcccaat gccaggatca gagtagtcgg catgctgccg agcttcggga cttcaaaaac 180 

aagatgttgc ctctcctgga ggtggcagag aaggagcggg agaccctcag aactgaagca 24 0 

gactccatct caggaagagt ggaccgtctt gaaagggagg tagactatct ggagacacag 300 

aacccagctt tgccctgtgt agagctggat gagaaggtga ctggaggtcc tggagccaaa 360 

ggcaagggcc gaagaaatga gaaatacgat atggtgacgg actgtagcta cacagtcgct 42 0 

caggtgaggt caatgaagat cctgaagcgg tttggtggtt cagttggcct atggaccaag 480 

gatccgctgg ggccagcaga gaagatctac gtgttagacg gcacccagaa cgacacggct 540 

tttgtcttcc caaggctgcg tgacttcacc cttgccatgg ctgcccggaa agcttcccga 600 

attcgggtgc ccttcccctg ggtaggcacg gggcagctgg tgtacggtgg cttcctttat 660 

tatgctcgaa ggcctcctgg aggacctgga gggggtggtg aattggagaa cactctgcag 72 0 

ctgatcaaat ttcacttggc aaaccgaaca gtggtggata gctcagtgtt ccctgcagag 780 

agcctgatac ccccctacgg cctgacagca gatacatata tcgacctggc agctgatgag 840 

gagggcctgt gggctgtcta tgccactcga gatgatgaca ggcatttgtg tctagccaag 900 

ttagacccac agacacttga cacagagcag cagtgggaca caccatgtcc cagagagaac 960 

gcagaggctg cgtttgtcat ctgtgggacc ctgtacgttg tctataacac ccgccctgcc 1020 

agtagggctc gtattcagtg ttccttcgat gccagtggta ctctcgcccc tgaaagggca 1080 

gcactctcct attttccacg ccgatatggt gcccatgcca gccttcgcta taacccccgt 1140 
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gagcgccagc tgtatgcctg ggatgatggc taccagattg tctacaaatt ggagatgaag 1200 
aagaaggagg aggaagtt 1218 

<210> 23 
<211> 406 
<212> PRT 

<213> Mus musculus 
<400> 23 

Met Gly Pro Ser Ala Pro Leu Leu Leu Leu Phe Phe Leu Ser Trp Thr 
15 10 15 

Gly Pro Leu Gin Gly Gin Gin His His Leu Val Glu Tyr Met Glu Arg 
20 25 30 

Arg Leu Ala Ala Leu Glu Glu Arg Leu Ala Gin Cys Gin Asp Gin Ser 
35 40 45 

Ser Arg His Ala Ala Glu Leu Arg Asp Phe Lys Asn Lys Met Leu Pro 
50 55 60 

Leu Leu Glu Val Ala Glu Lys Glu Arg Glu Thr Leu Arg Thr Glu Ala 
65 70 75 80 

Asp Ser He Ser Gly Arg Val Asp Arg Leu Glu Arg Glu Val Asp Tyr 
85 90 95 

Leu Glu Thr Gin Asn Pro Ala Leu Pro Cys Val Glu Leu Asp Glu Lys 
100 105 110 

Val Thr Gly Gly Pro Gly Ala Lys Gly Lys Gly Arg Arg Asn Glu Lys 
115 120 125 

Tyr Asp Met Val Thr Asp Cys Ser Tyr Thr Val Ala Gin Val Arg Ser 
130 135 140 

Met Lys He Leu Lys Arg Phe Gly Gly Ser Val Gly Leu Trp Thr Lys 
145 150 155 160 

Asp Pro Leu Gly Pro Ala Glu Lys He Tyr Val Leu Asp Gly Thr Gin 
165 170 175 

Asn Asp Thr Ala Phe Val Phe Pro Arg Leu Arg Asp Phe Thr Leu Ala 
180 185 190 

Met Ala Ala Arg Lys Ala Ser Arg He Arg Val Pro Phe Pro Trp Val 
195 200 205 

Gly Thr Gly Gin Leu Val Tyr Gly Gly Phe Leu Tyr Tyr Ala Arg Arg 
210 215 220 

Pro Pro Gly Gly Pro Gly Gly Gly Gly Glu Leu Glu Asn Thr Leu Gin 
225 230 235 240 

Leu He Lys Phe His Leu Ala Asn Arg Thr Val Val Asp Ser Ser Val 
245 250 255 

Phe Pro Ala Glu Ser Leu He Pro Pro Tyr Gly Leu Thr Ala Asp Thr 
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260 



265 



270 



Tyr He Asp Leu Ala Ala Asp Glu Glu Gly Leu Trp Ala Val Tyr Ala 
275 280 285 

Thr Arg Asp Asp Asp Arg His Leu Cys Leu Ala Lys Leu Asp Pro Gin 
290 295 300 

Thr Leu Asp Thr Glu Gin Gin Trp Asp Thr Pro Cys Pro Arg Glu Asn 
305 310 315 ~ 320 

Ala Glu Ala Ala Phe Val He Cys Gly Thr Leu Tyr Val Val Tyr Asn 
325 330 335 

Thr Arg Pro Ala Ser Arg Ala Arg He Gin Cys Ser Phe Asp Ala Ser 
340 345 350 

Gly Thr Leu Ala Pro Glu Arg Ala Ala Leu Ser Tyr Phe Pro Arg Arg 
355 360 365 

Tyr Gly Ala His Ala Ser Leu Arg Tyr Asn Pro Arg Glu Arg Gin Leu 
370 375 380 

Tyr Ala Trp Asp Asp Gly Tyr Gin He Val Tyr Lys Leu <51u Met Lys 
385 390 395 400 



Lys Lys Glu Glu Glu Val 
405 



<210> 24 
<211> 385 
<212> PRT 

<213> Mus musculus 



<400> 24 
Gin Gin His His 
1 

Glu Glu Arg Leu 
20 

Glu Leu Arg Asp 
35 

Glu Lys Glu Arg 
50 

Arg Val Asp Arg 
65 

Pro Ala Leu Pro 



Gly Ala Lys Gly 
100 

Asp Cys Ser Tyr 



Leu Val Glu Tyr 
5 

Ala Gin Cys Gin 



Phe Lys Asn Lys 
40 

Glu Thr Leu Arg 
55 

Leu Glu Arg Glu 
70 

Cys Val Glu Leu 
85 

Lys Gly Arg Arg 



Thr Val Ala Gin 



Met Glu Arg Arg 
10 

Asp Gin Ser Ser 
25 

Met Leu Pro Leu 



Thr Glu Ala Asp 
60 

Val Asp Tyr Leu 
75 

Asp Glu Lys Val 
90 

Asn Glu Lys Tyr 
105 

Val Arg Ser Met 



Leu Ala Ala Leu 

15 

Arg His Ala Ala 
30 

Leu Glu Val Ala 
45 

Ser He Ser Gly 



Glu Thr Gin Asn 
80 

Thr Gly Gly Pro 
95 

Asp Met Val Thr 
110 

Lys He Leu Lys 
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115 



120 



125 



Arg Phe Gly Gly 
130 



Ser Val Gly Leu Trp 
135 



Thr Lys Asp Pro Leu Gly Pro 
140 



Ala Glu Lys lie 
145 



Tyr Val Leu Asp Gly 
150 



Thr Gin Asn Asp Thr Ala Phe 
155 160 



Val Phe Pro Arg 



Leu Arg Asp Phe Thr 
165 



Leu Ala Met Ala Ala Arg Lys 
170 175 



Ala Ser Arg lie 
180 



Arg Val Pro Phe Pro 
185 



Trp Val Gly Thr Gly Gin Leu 
190 



Val Tyr Gly Gly 
195 



Phe Leu Tyr Tyr Ala 
200 



Arg Arg Pro Pro Gly Gly Pro 
205 



Gly Gly Gly Gly 
210 



Glu Leu Glu Asn Thr 
215 



Leu Gin Leu lie Lys Phe His 
220 



Leu Ala Asn Arg 
225 



Thr Val Val Asp Ser 
230 



Ser Val Phe Pro Ala Glu Ser 
235 240 



Leu lie Pro Pro 



Tyr Gly Leu Thr Ala 
245 



Asp Thr Tyr lie Asp Leu Ala 
250 255 



Ala Asp Glu Glu 
260 



Gly Leu Trp Ala Val 
265 



Tyr Ala Thr Arg Asp Asp Asp 
' 270 



Arg His Leu Cys 
275 



Leu Ala Lys Leu Asp 
280 



Pro Gin Thr Leu Asp Thr Glu 
285 



Gin Gin Trp Asp 
290 



Thr Pro Cys Pro Arg 
295 



Glu Asn Ala Glu Ala Ala Phe 
300 



Val He Cys Gly 
305 



Thr Leu Tyr Val Val 
310 



Tyr Asn Thr Arg Pro Ala Ser 
315 320 



Arg Ala Arg He 



Gin Cys Ser Phe Asp 
325 



Ala Ser Gly Thr Leu Ala Pro 
330 335 



Glu Arg Ala Ala 
340 



Leu Ser Tyr Phe Pro 
345 



Arg Arg Tyr Gly Ala His Ala 
350 



Ser Leu Arg Tyr 
355 



Asn Pro Arg Glu Arg 
360 



Gin Leu Tyr Ala Trp Asp Asp 
365 



Gly Tyr Gin He 
370 



Val Tyr Lys Leu Glu 
375 



Met Lys Lys Lys Glu Glu Glu 
380 



Val 
385 



<210> 25 
<211> 21 
<212> PRT 

<213> Mus musculus 
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<400> 25 

Met Gly Pro Ser Ala Pro Leu Leu Leu Leu Phe Phe Leu Ser Trp Thr 
1 5 10 15 

Gly Pro Leu Gin Gly 
20 



<210> 26 

<211> 1869 

<212> DNA 

<213> Homo sapiens 
<220> 

<221> modif ied_base 

<222> all "n" positions 

<223> n=a, c, g, or t 



<400> 26 

gtcgacccac 

gcgcacgctc 

cgcccctgac 

ccaacttgnt 

agctgcaact 

cttcagcatg 

ttgtgatgtg 

gggtcacaac 

ccctgcgggt 

tgcaagacaa 

tggttcctcc 

tgaccctgag 

agcttccatc 

gcctcaccaa 

tgggcactgc 

ttgctgaagc 

tgtaccaccg 

ttgctccccg 

tttcctctgt 

cattgacccc 

acagatgggg 

tttgagccgc 

ggtatgatga 

tcacctctag 

ctgcccccac 

acagaaggag 

ctccttatga 

cagtcactga 

tggctcccac 

actggggcag 

tttgcaaatt 

ggcggccgc 



gcgtncntcc 
cggccgtcgc 
tccgtcccgg 
gcggtttttg 
gcacttgccc 
gtacaccttg 
gttcttcaaa 
aagcaaacct 
ggagggtctc 
acaaggcaaa 
agctcctcca 
ctgccagtct 
cttccagact 
cctttcgtct 
ccaatgtaat 
tgttgtgggt 
ccggggcaag 
gaccctgccc 
cacctccgca 
cacgcccagt 
cccaccctca 
atgggtgctg 
ccccaccact 
cacagaggcc 
ctctctttac 
aagaggaagt 
agccagctgc 
gtctcccagg 
tccagctccc 
aggataggga 
taaataaaga 



agcgtncgga 
gcagcctcgg 
ccagggaggg 
ttcctggggc 
gccaaccggt 
cacagggagg 
cagaaagaaa 
ggagtatcct 
caggagaaag 
tctaggggcc 
tcctgccgtc 
ccaaggagta 
ttctttgcac 
tccatggctg 
gtgacgctgg 
accctggttg 
gccctggagg 
tggcccaaga 
cgagccctcc 
ctatccagcc 
accaatatcc 
ngcctgtgat 
cattggctaa 
tgagtcatgg 
tgtgggaaaa 
ggatctggaa 
tgaaattagc 
cccccttgat 
tgtattgata 
atctcttatt 
tacataatgt 



gccgccctgg 
cacctgcagg 
ccatgatttc 
tgagtgccct 
tgcaggcggt 
tgtcttcatc 
aggaggatca 
tggtctactc 
actctggccc 
acagcatcaa 
tccagggtgt 
agcccgctgt 
cagcattaga 
gagtctatgt 
aagtgagcac 
gactggggtt 
agccagccaa 
gctcagacac 
ggccacccca 
aggccctgcc 
cccatccctg 
ggngcctgcc 
aggatttggg 
gaaagagtca 
ccatctcagt 
ttgggaggag 
tactcaccaa 
ctgtacccca 
taacctgtca 
aaaactaaca 
ttgtatgaga 



gtgtcagcgg 
tccgtgcgtc 
cctcccgggg 
cgcgcccccc 
ggaggagggg 
ccagccatgg 
ggtgttgtcc 
catgccctcc 
ctacagctgc 
aaccttagaa 
gccccatgtg 
ccaataccag 
tgtcatccgt 
ctgcaaggcc 
agggcctgga 
gctggctggg 
tgatatcaag 
aatctccaag 
tggccctccc 
ctcaccaaga 
gtggggtttt 
cagagtcaag 
gtctctcctt 
cactcctgac 
aagacctaag 
cctccaccca 
gagtgagggg 
cccctatcta 
ggctggcttg 
tgaaatatgt 
taagaaaaaa 



ctcggctccc 
ccgcggctgg 
cccctggtga 
tcgcgggccc 
gaaagtggtg 
gaggtgccct 
tacatcaatg 
cggaacctgt 
tccgtgaatg 
ctcaatgtac 
ggggcaaacg 
tgggatcggc 
gggtctttaa 
cacaatgagg 
gctgcagtgg 
ctggtcctct 
gaggatgcca 
aatgggaccc 
aggcctggtg 
catgcccacg 
ttcctttggc 
ctggctctct 
cctataaggg 
ccttagtact 
tgtccaggag 
cccctgactc 
cagagacttc 
acaccaccct 
gttaggtttt 
gttgttttca 
aaaaaaaaag 



60 

120 

180 

240 

300* 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1869 



<210> 27 

<211> 1110 

<212> DNA 

<213> Homo sapiens 



<220> 
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<221> modif ied_base 
<222> all "n" positions 
<223> n=a, c, g, or t 

<400> 27 

atgatttccc tcccggggcc cctggtgacc aacttgntgc ggtttttgtt cctggggctg 60 

agtgccctcg cgcccccctc gcgggcccag ctgcaactgc acttgcccgc caaccggttg 120 

caggcggtgg aggaggggga aagtggtgct tcagcatggt acaccttgca cagggaggtg 180 

tcttcatccc agccatggga ggtgcccttt gtgatgtggt tcttcaaaca gaaagaaaag 240 

gaggatcagg tgttgtccta catcaatggg gtcacaacaa gcaaacctgg agtatccttg 300 

gtctactcca tgccctcccg gaacctgtcc ctgcgggtgg agggtctcca ggagaaagac 360 

tctggcccct acagctgctc cgtgaatgtg caagacaaac aaggcaaatc taggggccac 420 

agcatcaaaa ccttagaact caatgtactg gttcctccag ctcctccatc ctgccgtctc 480 

cagggtgtgc cccatgtggg ggcaaacgtg accctgagct gccagtctcc aaggagtaag 54 0 

cccgctgtcc aataccagtg ggatcggcag cttccatcct tccagacttt ctttgcacca 600 

gcattagatg tcatccgtgg gtctttaagc ctcaccaacc tttcgtcttc catggctgga 660 

gtctatgtct gcaaggccca caatgaggtg ggcactgccc aatgtaatgt gacgctggaa 720 

gtgagcacag ggcctggagc tgcagtggtt gctgaagctg ttgtgggtac cctggttgga 780 

ctggggttgc tggctgggct ggtcctcttg taccaccgcc ggggcaaggc cctggaggag 84 0 

ccagccaatg atatcaagga ggatgccatt gctccccgga ccctgccctg gcccaagagc 900 

tcagacacaa tctccaagaa tgggaccctt tcctctgtca cctccgcacg agccctccgg 960 

ccaccccatg gccctcccag gcctggtgca ttgaccccca cgcccagtct atccagccag 1020 

gccctgccct caccaagaca tgcccacgac agatggggcc caccctcaac caatatcccc 1080 
catccctggt ggggtttttt cctttggctt 1110 

<210> 28 

<211> 370 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (13) 

<22 3> Xaa=unknown amino acid 
<400> 28 

Met lie Ser Leu Pro Gly Pro Leu Val Thr Asn Leu Xaa Arg Phe Leu 
15 10 15 

Phe Leu Gly Leu Ser Ala Leu Ala Pro Pro Ser Arg Ala Gin Leu Gin 
20 25 30 

Leu His Leu Pro Ala Asn Arg Leu Gin Ala Val Glu Glu Gly Glu Ser 
35 40 45 

Gly Ala Ser Ala Trp Tyr Thr Leu His Arg Glu Val Ser Ser Ser Gin 
50 55 60 

Pro Trp Glu Val Pro Phe Val Met Trp Phe Phe Lys Gin Lys Glu Lys 
65 70 75 80 

Glu Asp Gin Val Leu Ser Tyr lie Asn Gly Val Thr Thr Ser Lys Pro 
85 90 95 

Gly Val Ser Leu Val Tyr Ser Met Pro Ser Arg Asn Leu Ser Leu Arg 
100 105 110 

Val Glu Gly Leu Gin Glu Lys Asp Ser Gly Pro Tyr Ser Cys Ser Val 
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115 



120 



125 



Asn Val Gin Asp Lys Gin Gly Lys Ser Arg Gly His Ser lie Lys Thr 
130 135 "* * 140 

Leu Glu Leu Asn Val Leu Val Pro Pro Ala Pro Pro Ser Cys Arg Leu 
145 150 155 ~ 160 

Gin Gly Val Pro His Val Gly Ala Asn Val Thr Leu Ser Cys Gin Ser 
165 170 175 

Pro Arg Ser Lys Pro Ala Val Gin Tyr Gin Trp Asp Arg Gin Leu Pro 
180 185 190 

Ser Phe Gin Thr Phe Phe Ala Pro Ala Leu Asp Val lie Arg Gly Ser 
195 200 205 

Leu Ser Leu Thr Asn Leu Ser Ser Ser Met Ala Gly Val Tyr Val Cys 
210 215 220 

Lys Ala His Asn Glu Val Gly Thr Ala Gin Cys Asn Val Thr Leu Glu 
225 230 235 240 

Val Ser Thr Gly Pro Gly Ala Ala Val Val Ala Glu Ala Val Val Gly 
245 250 255 

Thr Leu Val Gly Leu Gly Leu Leu Ala Gly Leu Val Leu Leu Tyr His 
260 265 270 

Arg Arg Gly Lys Ala Leu Glu Glu Pro Ala Asn Asp He Lys Glu Asp 
275 280 285 

Ala He Ala Pro Arg Thr Leu Pro Trp Pro Lys Ser Ser Asp Thr He 
290 295 300 

Ser Lys Asn Gly Thr Leu Ser Ser Val Thr Ser Ala Arg Ala Leu Arg 
305 310 315 320 

Pro Pro His Gly Pro Pro Arg Pro Gly Ala Leu Thr Pro Thr Pro Ser 
325 330 335 

Leu Ser Ser Gin Ala Leu Pro Ser Pro Arg His Ala His Asp Arg Trp 
340 345 350 

Gly Pro Pro Ser Thr Asn He Pro His Pro Trp Trp Gly Phe Phe Leu 
355 360 365 



Trp Leu 
370 



<210> 29 
<211> 341 
<212> PRT 

<213> Mus musculus 
<400> 29 

Gin Leu Gin Leu His Leu Pro Ala Asn Arg Leu Gin Ala Val Glu Glu 
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10 



15 



Gly Glu Ser Gly Ala Ser Ala Trp Tyr Thr Leu His Arg Glu Val Ser 
20 25 30 

Ser Ser Gin Pro Trp Glu Val Pro Phe Val Met Trp Phe Phe Lys Gin 
35 40 45 

Lys Glu Lys Glu Asp Gin Val Leu Ser Tyr lie Asn Gly Val Thr Thr 
50 55 60 

Ser Lys Pro Gly Val Ser Leu Val Tyr Ser Met Pro Ser Arg Asn Leu 
65 70 75 80 

Ser Leu Arg Val Glu Gly Leu Gin Glu Lys Asp Ser Gly Pro Tyr Ser 
85 90 95 

Cys Ser Val Asn Val Gin Asp Lys Gin Gly Lys Ser Arg Gly His Ser 
100 105 110 

He Lys Thr Leu Glu Leu Asn Val Leu Val Pro Pro Ala Pro Pro Ser 
115 120 125 

Cys Arg Leu Gin Gly Val Pro His Val Gly Ala Asn Val Thr Leu Ser 
130 135 140 

Cys Gin Ser Pro Arg Ser Lys Pro Ala Val Gin Tyr Gin Trp Asp Arg 
145 150 155 160 

Gin Leu Pro Ser Phe Gin Thr Phe Phe Ala Pro Ala Leu Asp Val He 
165 170 175 

Arg Gly Ser Leu Ser Leu Thr Asn Leu Ser Ser Ser Met Ala Gly Val 
180 185 190 

Tyr Val Cys Lys Ala His Asn Glu Val Gly Thr Ala Gin Cys Asn Val 
195 200 205 

Thr Leu Glu Val Ser Thr Gly Pro Gly Ala Ala Val Val Ala Glu Ala 
210 215 220 

Val Val Gly Thr Leu Val Gly Leu Gly Leu Leu Ala Gly Leu Val Leu 
225 230 235 240 

Leu Tyr His Arg Arg Gly Lys Ala Leu Glu Glu Pro Ala Asn Asp He 
245 250 255 

Lys Glu Asp Ala He Ala Pro Arg Thr Leu Pro Trp Pro Lys Ser Ser 
260 265 270 

Asp Thr He Ser Lys Asn Gly Thr Leu Ser Ser Val Thr Ser Ala Arg 
275 280 285 

Ala Leu Arg Pro Pro His Gly Pro Pro Arg Pro Gly Ala Leu Thr Pro 
290 295 300 



Thr Pro Ser Leu Ser Ser Gin Ala Leu Pro Ser Pro Arg His Ala His 
305 310 315 320 
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Asp Arg Trp Gly Pro Pro Ser Thr Asn lie Pro His Pro Trp Trp Gly 
325 330 335 

Phe Phe Leu Trp Leu 
340 



<210> 30 
<211> 29 
<212> PRT 

<213> Mus musculus 
<220> 

<221> SITE 
<222> (13) 

<223> Xaa=unknown amino acid 
<400> 30 

Met lie Ser Leu Pro Gly Pro Leu Val Thr Asn Leu Xaa Arg Phe Leu 
1 5 10 15 

Phe Leu Gly Leu Ser Ala Leu Ala Pro Pro Ser Arg Ala 
20 25 



<210> 31 
<211> 246 
<212> PRT 

<213> Mus musculus 
<220> 

<221> SITE 
<222> (13) 

<223> Xaa=unknown amino acid 
<400> 31 

Met He Ser Leu Pro Gly Pro Leu Val Thr Asn Leu Xaa Arg Phe Leu 
1 5 10 15 

Phe Leu Gly Leu Ser Ala Leu Ala Pro Pro Ser Arg Ala Gin Leu Gin 
20 25 30 

Leu His Leu Pro Ala Asn Arg Leu Gin Ala Val Glu Glu Gly Glu Ser 
35 .40 45 

Gly Ala Ser Ala Trp Tyr Thr Leu His Arg Glu Val Ser Ser Ser Gin 
50 55 60 

Pro Trp Glu Val Pro Phe Val Met Trp Phe Phe Lys Gin Lys Glu Lys 
65 70 75 80 

Glu Asp Gin Val Leu Ser Tyr He Asn Gly Val Thr Thr Ser Lys Pro 
85 90 95 

Gly Val Ser Leu Val Tyr Ser Met Pro Ser Arg Asn Leu Ser Leu Arg 
100 105 " 110 

Val Glu Gly Leu Gin Glu Lys Asp Ser Gly Pro Tyr Ser Cys Ser Val 
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Asn Val 
130 



115 120 

Gin Asp Lys Gin Gly Lys Ser Arg 
135 



Gly His 
140 



125 

Ser lie Lys Thr 



Leu Glu Leu Asn Val Leu Val Pro Pro Ala 
145 150 



Gin Gly Val Pro 
Pro Arg 



Ser Phe 



Ser Lys 
180 

Gin Thr 
195 



His Val Gly Ala 
165 

Pro Ala Val Gin 



Phe Phe Ala Pro 
200 



Asn Val 
170 



Pro Pro Ser Cys Arg Leu 

155 160 

Thr Leu Ser Cys Gin Ser 
175 



Tyr Gin 
185 



Trp Asp 
Ala Leu Asp Val 



Leu Ser Leu Thr Asn Leu Ser Ser Ser Met Ala Gly 
210 215 220 

Lys Ala His Asn Glu Val Gly Thr Ala Gin Cys Asn 
225 230 235 



Arg Gin Leu Pro 
190 

lie Arg Gly Ser 
205 

Val Tyr Val Cys 



Val Thr Leu Glu 
240 



Val Ser Thr Gly 



Pro Gly 
245 



<210> 32 

<211> 653 

<212> DNA 

<213> Homo sapiens 

<400> 32 

ttttttgcat gtaacttttt tattgaggca caacaaggca ttgtaacttg cctggacttg 60 

aggcagtcag tttagtaagc tgaacgttaa tacagttaag gattaagtgc aaacaatata 120 

cattcacagc ttgactagcg aggctacatc acaatttata aagtgccaga ttagtgctaa 180 

ttgtcattca gcttgatttt tcacctcagg aaggaaaaca aaaaagtaag gacctcctcc 240 

ctctaggaac aaaaaacatt ttcctaaacc aatcagtcat gagggcaaag actacttttc 300 

cttcaatccc actaattaga acaccatcct tttattgtca atactgtact gactttcaat 360 

cttgataaag aagatagcct gaaaacgtag aatatttcca gctacttcca taaattgctc 420 

ccctgtgcag acgtaaccat atctggtctc cctggaagag ctgaagaatt gcatgattgc 480 

tagcagtttc atggtctgga gcaccatcat tggcataggc tgataccaag acctcttcat 540 

tcttcantga ggttgacata cagtggcaca ttcactgcca gcttttacat gtgaaaaatg 600 

aaaaacgtag tgccattcac ttggcaatta aatctaccaa agctgagatc aaa , 653 

<210> 33 

<211> 25 

<212> PRT 

<213> Mus musculus 

<400> 33 

Ala Ala Val Val Ala Glu Ala Val Val Gly Thr Leu Val Gly Leu Gly 
1 5 10 15 ' 

Leu Leu Ala Gly Leu Val Leu Leu Tyr 
20 25 
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<210> 34 
<211> 99 
<212> PRT 

<213> Mus musculus 
<400> 34 

His Arg Arg Gly Lys Ala Leu Glu Glu Pro Ala Asn Asp lie Lys Glu 
1 5 10 15 

Asp Ala lie Ala Pro Arg Thr Leu Pro Trp Pro Lys Ser Ser Asp Thr 
20 25 30 

He Ser Lys Asn Gly Thr Leu Ser Ser Val Thr Ser Ala Arg Ala Leu 
35 40 45 

Arg Pro Pro His Gly Pro Pro Arg Pro Gly Ala Leu Thr Pro Thr Pro 
50 55 60 

Ser Leu Ser Ser Gin Ala Leu Pro Ser Pro Arg His Ala His Asp Arg 
65 70 75 80 

Trp Gly Pro Pro Ser Thr Asn He Pro His Pro Trp Trp Gly Phe Phe 
85 90 95 

Leu Trp Leu 



<210> 35 
<211> 80 
<212> PRT 

<213> Mus musculus 
<400> 35 

Gly Ala Ser Ala Trp Tyr Thr Leu His Arg Glu Val Ser Ser Ser Gin 
15 10 15 

Pro Trp Glu Val Pro Phe Val Met Trp Phe Phe Lys Gin Lys Glu Lys 
20 25 30 

Glu Asp Gin Val Leu Ser Tyr He Asn Gly Val Thr Thr Ser Lys Pro 
35 40 45 

Gly Val Ser Leu Val Tyr Ser Met Pro Ser Arg Asn Leu Ser Leu Arg 
50 55 60 

Val Glu Gly Leu Gin Glu Lys Asp Ser Gly Pro Tyr Ser Cys Ser Val 
65 70 75 80 



<210> 36 

<211> 60 

<212> PRT 

<213> Mus musculus 
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<400> 36 

Gly Ala Asn Val Thr Leu Ser Cys Gin Ser Pro Arg Ser Lys Pro Ala 
1 5 10 * 15 

Val Gin Tyr Gin Trp Asp Arg Gin Leu Pro Ser Phe Gin Thr Phe Phe 
20 25 30 

Ala Pro Ala Leu Asp Val He Arg Gly Ser Leu Ser Leu Thr Asn Leu 
35 40 45 



Ser Ser Ser Met Ala Gly Val Tyr Val Cys Lys Ala 
50 55 60 



<210> 37 
<211> 1846 
<212> DNA 

<213> Mus musculus 
<400> 37 

gtcgacccac gcgtccggtg cacattcggg 
caccgtgtgt ccaactctcc ctgagtactc 
tggaaccccc gagaccagct tgctgcgggt 
cttctcccga gctcagatgg agttgcacgt 
agagggagaa gaagtggtgc tccccgcctg 
ccacccccgg gaggtgccca tcctgatctg 
ccaggtgttg tcttacatta atggagtcat 
ctctatctct tcacggaatg tgtccctgcg 
gacttaccgc tgttctgtca atgtgcagaa 
caaaagcata gagctcaaag tgctggttcc 
tgtaccctat gtcgggacca atgtgaccct 
tgctcagtac cagtgggaga ggctggcccc 
agatgctgtt cgtggatctt taaagctcac 
tgtctgcaag gctcaaaaca gagtgggctt 
gacagggtcc aaggctgcag tggtcgctgg 
gctgatagct gggctggtcc tgttgtacca 
caatgatatc aaggaagatg ccattgctcc 
cacaatctcc aagaatggga cactttcttc 
caaggctgct cctccaagac ctggcacatt 
cctgtcctca ccaagactgc ccagggtaga 
cccaggtggg gtttcttctt ctgctctgag 
tgcacagagt caggctgggt ctcttgtgtg 
tatctgacct ttctgtaaag gtctccttgt 
ccacattcta gacctccagt cctttgctcc 
tcagtaagac taaaatctgg gtcaaaggac 
ttgggagtga ggaggcttca cttcctccct 
gaagatcggc taccctccaa gggctctgga 
ctgtgatctg tacaacaccc ttatctaatg 
tattaatata acctgtcctg ctggcttggc 
gacattttaa aatctgactt gaaattgatg 
agatacatcg catttgcatg gaaaaaaaaa 

<210> 38 
<211> 1182 
<212> DNA 

<213> Mus musculus 
<400> 38 



ttgccgccgc tcacccacaa cacctgtaga 60 
cgggccaagg agggccatga ttcttcaggc 120 
tttgttcctg ggactgagta cccttgctgc 180 
gcccccgggc ctcaacaaat tggaagcggt 24 0 
gtacacgatg gcacgggagg agtcgtggtc 3 00 
gttcttggaa caagaaggga aggaaccaaa 360 
gacaaataaa cctggaacag ccctggtcca 420 
cctgggggca ctccaggagg gagactctgg 480 
tgatgaaggc aaaagtatag gccacagcat 54 0 
tccagctcct ccatcctgta gtttacaggg 600 
gaactgcaag tccccaagga gtaaacctac 660 
atcctcccag gtcttctttg gaccagcctt 720 
taacctttcc attgccatgt ctggagtcta 780 
tgccaagtgc aacgtgacct tggacgtgat 840 
agcagttgtg ggcacttttg ttgggttggt 900 
gcgccggagc aagaccttgg aagagctggc 960 
ccggaccttg ccttggacca aaggctcaga 1020 
ggtcacctca gcacgagctc tgcggccacc 1080 
tactcccaca cccagtgtct ctagccaggc 1140 
tgaaccccca cctcaggcag tgtccctgac 1200 
ccgcatgggt gctgtgcctg tgatggtgcc 1260 
atagcccagg cactcattag ctacatctgg 1320 
ggcacagagg actcaatctt gggaggatgc 1380 
tacctccttc tattgttgga atactgggcc 1440 
aaaaggagga aatggacctg aggtaggggg 1500 
gcttctccct gaagccagat gaatgctgcg 1560 
ggagactgcc agtcagtgat gcccctggct 1620 
ctgtcctttg ccgttcgctc catctccctg 1680 
tgggttttgt tgtagcaggg ggataggaaa 1740 
tttttgtttt tattttgcaa atttcaataa 1800 
aaaaaagggc ggccgc 1846 
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atgattcttc 
agtacccttg 
aaattggaag 
gaggagtcgt 
gggaaggaac 
acagccctgg 
gagggagact 
ataggccaca 
tgtagtttac 
aggagtaaac 
tttggaccag 
atgtctggag 
accttggacg 
tttgttgggt 
ttggaagagc 
accaaaggct 
gctctgcggc 
gtctctagcc 
gcagtgtccc 
cctgtgatgg 



aggctggaac 
ctgccttctc 
cggtagaggg 
ggtcccaccc 
caaaccaggt 
tccactctat 
ctgggactta 
gcatcaaaag 
agggtgtacc 
ctactgctca 
ccttagatgc 
tctatgtctg 
tgatgacagg 
tggtgctgat 
tggccaatga 
cagacacaat 
cacccaaggc 
aggccctgtc 
tgaccccagg 
tgcctgcaca 



ccccgagacc 
ccgagctcag 
agaagaagtg 
ccgggaggtg 
gttgtcttac 
ctcttcacgg 
ccgctgttct 
catagagctc 
ctatgtcggg 
gtaccagtgg 
tgttcgtgga 
caaggctcaa 
gtccaaggct 
agctgggctg 
tatcaaggaa 
ctccaagaat 
tgctcctcca 
ctcaccaaga 
tggggtttct 
gagtcaggct 



agcttgctgc 
atggagttgc 
gtgctccccg 
cccatcctga 
attaatggag 
aatgtgtccc 
gtcaatgtgc 
aaagtgctgg 
accaatgtga 
gagaggctgg 
tctttaaagc 
aacagagtgg 
gcagtggtcg 
gtcctgttgt 
gatgccattg 
gggacacttt 
agacctggca 
ctgcccaggg 
tcttctgctc 
gggtctcttg 



gggttttgtt 

acgtgccccc 

cctggtacac 

tctggttctt 

tcatgacaaa 

tgcgcctggg 

agaatgatga 

ttcctccagc 

ccctgaactg 

ccccatcctc 

tcactaacct 

gctttgccaa 

ctggagcagt 

accagcgccg 

ctccccggac 

cttcggtcac 

catttactcc • 

tagatgaacc 

tgagccgcat 

tg 



cctgggactg 
gggcctcaac 
gatggcacgg 
ggaacaagaa 
taaacctgga 
ggcactccag 
aggcaaaagt 
tcctccatcc 
caagtcccca 
ccaggtcttc 
ttccattgcc 
gtgcaacgtg 
tgtgggcact 
gagcaagacc 
cttgccttgg 
ctcagcacga 
cacacccagt 
cccacctcag 
gggtgctgtg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1182 



<210> 39 
<211> 394 
<212> PRT 

<213> Mus musculus 
<400> 39 

Met lie Leu Gin Ala Gly Thr Pro Glu Thr Ser Leu Leu Arg Val Leu 
1 5 10 15 

Phe Leu Gly Leu Ser Thr Leu Ala Ala Phe Ser Arg Ala Gin Met Glu 
20 25 30 

Leu His Val Pro Pro Gly Leu Asn Lys Leu Glu Ala Val Glu Gly Glu 
35 40 45 

Glu Val Val Leu Pro Ala Trp Tyr Thr Met Ala Arg Glu Glu Ser Trp 
50 55 60 

Ser His Pro Arg Glu Val Pro He Leu He Trp Phe Leu Glu Gin Glu 
65 70 75 80 

Gly Lys Glu Pro Asn Gin Val Leu Ser Tyr He Asn Gly Val Met Thr 
85 90 95 

Asn Lys Pro Gly Thr Ala Leu Val His Ser He Ser Ser Arg Asn Val 
100 105 110 

Ser Leu Arg Leu Gly Ala Leu Gin Glu Gly Asp Ser Gly Thr Tyr Arg 
115 120 125 

Cys Ser Val Asn Val Gin Asn Asp Glu Gly Lys Ser He Gly His Ser 
130 135 140 

He Lys Ser lie Glu Leu Lys Val Leu Val Pro Pro Ala Pro Pro Ser 
145 150 155 160 



Cys Ser Leu Gin Gly Val Pro Tyr Val Gly Thr Asn Val Thr Leu Asn 
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165 



170 



175 



Cys Lys Ser Pro Arg Ser Lys Pro Thr Ala Gin Tyr Gin Trp Glu Arg 
180 185 190 

Leu Ala Pro Ser Ser Gin Val Phe Phe Gly Pro Ala Leu Asp Ala Val 
195 200 205 

Arg Gly Ser Leu Lys Leu Thr Asn Leu Ser lie Ala Met Ser Gly Val 
210 ' 215 220 

Tyr Val Cys Lys Ala Gin Asn Arg Val Gly Phe Ala Lys Cys Asn Val 
225 230 235 240 

Thr Leu Asp Val Met Thr Gly Ser Lys Ala Ala Val Val Ala Gly Ala 
245 250 255 

Val Val Gly Thr Phe Val Gly Leu Val Leu He Ala Gly Leu Val Leu 
260 265 270 

Leu Tyr Gin Arg Arg Ser Lys Thr Leu Glu Glu Leu Ala Asn Asp He 
275 280 285 

Lys Glu Asp Ala He Ala Pro Arg Thr Leu Pro Trp Thr Lys Gly Ser 
290 295 300 

Asp Thr lie Ser Lys Asn Gly Thr Leu Ser Ser Val Thr Ser Ala Arg 
305 310 315 320 

Ala Leu Arg Pro Pro Lys Ala Ala Pro Pro Arg Pro Gly Thr Phe Thr 
325 330 335 

Pro Thr Pro Ser Val Ser Ser Gin Ala Leu Ser Ser Pro Arg Leu Pro 
340 345 350 

Arg Val Asp Glu Pro Pro Pro Gin Ala Val Ser Leu Thr Pro Gly Gly 
355 360 365 

Val Ser Ser Ser Ala Leu Ser Arg Met Gly Ala Val Pro Val Met Val 
370 375 380 



Pro Ala Gin Ser Gin Ala Gly Ser Leu Val 
385 390 



<210> 40 
<211> 365 
<212> PRT 

<213> Mus musculus 
<400> 40 

Gin Met Glu Leu His Val Pro Pro Gly Leu Asn Lys Leu Glu Ala Val 
15 10 15 

Glu Gly Glu Glu Val Val Leu Pro Ala Trp Tyr Thr Met Ala Arg Glu 
20 25 30 

Glu Ser Trp Ser His Pro Arg Glu Val Pro He Leu He Trp Phe Leu 
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35 



40 



45 



Glu Gin Glu Gly Lys Glu Pro Asn Gin Val Leu Ser Tyr He Asn Gly 
50 55 60 

Val Met Thr Asn Lys Pro Gly Thr Ala Leu Val His Ser He Ser Ser 
65 70 75 80 

Arg Asn Val Ser Leu Arg Leu Gly Ala Leu Gin Glu Gly Asp Ser Gly 
85 90 95 

Thr Tyr Arg Cys Ser Val Asn Val Gin Asn Asp Glu Gly Lys Ser He 
100 105 110 

Gly His Ser He Lys Ser He Glu Leu Lys Val Leu Val Pro Pro Ala 
115 120 125 

Pro Pro Ser Cys Ser Leu Gin Gly Val Pro Tyr Val Gly Thr Asn Val 
130 135 140 

Thr Leu Asn Cys Lys Ser Pro Arg Ser Lys Pro Thr Ala Gin Tyr Gin 
145 150 155 160 

Trp Glu Arg Leu Ala Pro Ser Ser Gin Val Phe Phe Gly Pro Ala Leu 
165 170 175 

Asp Ala Val Arg Gly Ser Leu Lys Leu Thr Asn Leu Ser He Ala Met 
180 185 190 

Ser Gly Val Tyr Val Cys Lys Ala Gin Asn Arg Val Gly Phe Ala Lys 
195 200 205 

Cys Asn Val Thr Leu Asp Val Met Thr Gly Ser Lys Ala Ala Val Val 
210 215 220 

Ala Gly Ala Val Val Gly Thr Phe Val Gly Leu Val Leu He Ala Gly 
225 230 235 240 

Leu Val Leu Leu Tyr Gin Arg Arg Ser Lys Thr Leu Glu Glu Leu Ala 
245 250 255 

Asn Asp He Lys Glu Asp Ala He Ala Pro Arg Thr Leu Pro Trp Thr 
260 265 270 

Lys Gly Ser Asp Thr He Ser Lys Asn Gly Thr Leu Ser Ser Val Thr 
275 280 285 

Ser Ala Arg Ala Leu Arg Pro Pro Lys Ala Ala Pro Pro Arg Pro Gly 
2 90 2 95 3 00 

Thr Phe Thr Pro Thr Pro Ser Val Ser Ser Gin Ala Leu Ser Ser Pro 
305 310 315 320 

Arg Leu Pro Arg Val Asp Glu Pro Pro Pro Gin Ala Val Ser Leu Thr 
325 330 335 



Pro Gly Gly Val Ser Ser Ser Ala Leu Ser Arg Met Gly Ala Val Pro 
340 345 350 
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Val Met Val Pro Ala Gin Ser Gin Ala Gly Ser Leu Val 
355 360 365 



<210> 41 
<211> 29 
<212> PRT 

<213> Mus musculus 
<400> 41 

Met lie Leu Gin Ala Gly Thr Pro Glu Thr Ser Leu Leu Arg Val Leu 
15 10 15 

Phe Leu Gly Leu Ser Thr Leu Ala Ala Phe Ser Arg Ala 
20 25 



<210> 42 
<211> 249 
<212> PRT 

<213> Mus musculus 
<400> 42 

Met lie Leu Gin Ala Gly Thr Pro Glu Thr Ser Leu Leu Arg Val Leu 
15 10 15 

Phe Leu Gly Leu Ser Thr Leu Ala Ala Phe Ser Arg Ala Gin Met Glu 
20 25 30 

Leu His Val Pro Pro Gly Leu Asn Lys Leu Glu Ala Val Glu Gly Glu 
35 40 45 

Glu Val Val Leu Pro Ala Trp Tyr Thr Met Ala Arg Glu Glu Ser Trp 
50 55 60 

Ser His Pro Arg Glu Val Pro lie Leu lie Trp Phe Leu Glu Gin Glu 
65 70 75 80 

Gly Lys Glu Pro Asn Gin Val Leu Ser Tyr lie Asn Gly Val Met Thr 
85 90 95 

Asn Lys Pro Gly Thr Ala Leu Val His Ser lie Ser Ser Arg Asn Val 
100 105 110 

Ser Leu Arg Leu Gly Ala Leu Gin Glu Gly Asp Ser Gly Thr Tyr Arg 
115 120 125 

Cys Ser Val Asn Val Gin Asn Asp Glu Gly Lys Ser lie Gly His Ser 
130 135 140 

lie Lys Ser lie Glu Leu Lys Val Leu Val Pro Pro Ala Pro Pro Ser 
145 150 155 160 

Cys Ser Leu Gin Gly Val Pro Tyr Val Gly Thr Asn Val Thr Leu Asn 
165 170 175 

Cys Lys Ser Pro Arg Ser Lys Pro Thr Ala Gin Tyr Gin Trp Glu Arg 
180 185 190 
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Leu Ala Pro Ser Ser Gin Val Phe Phe Gly Pro Ala Leu Asp Ala Val 
195 200 205 

Arg Gly Ser Leu Lys Leu Thr Asn Leu Ser lie Ala Met Ser Gly Val 
210 215 220 

Tyr Val Cys Lys Ala Gin Asn Arg Val Gly Phe Ala Lys Cys Asn Val 
225 230 235 240 

Thr Leu Asp Val Met Thr Gly Ser Lys 
245 



<210> 43 
<211> 355 
<212> PRT 

<213> Mus musculus 
<220> 

<221> SITE 
<222> (355) 

<223> Xaa=unknown amino acid 
<400> 43 

Met Gly Pro Ser Thr Pro Leu Leu lie Leu Phe Leu Leu Ser Trp Ser 
1 5 10 15 

Gly Pro Leu Gin Gly Gin Gin His His Leu Val Glu Tyr Met Glu Arg 
2 0 2 5 3 0 

Arg Leu Ala Ala Leu Glu Glu Arg Leu Ala Gin Cys Gin Asp Gin Ser 
35 40 45 

Ser Arg His Ala Ala Glu Leu Arg Asp Phe Lys Asn Lys Met Leu Pro 
50 55 60 

Leu Leu Glu Val Ala Glu Lys Glu Arg Glu Ala Leu Arg Thr Glu Ala 
65 70 75 80 

Asp Thr lie Ser Gly Arg Val Asp Arg Leu Glu Arg Glu Val Asp Tyr 
85 90 95 

Leu Glu Thr Gin Asn Pro Ala Leu Pro Cys Val Glu Phe Asp Glu Lys 
100 105 110 

Val Thr Gly Gly Pro Gly Thr Lys Gly Lys Gly Arg Arg Asn Glu Lys 
115 120 125 

Tyr Asp Met Val Thr Asp Cys Gly Tyr Thr lie Ser Gin Val Arg Ser 
130 135 140 

Met Lys lie Leu Lys Arg Phe Gly Gly Pro Ala Gly Leu Trp Thr Lys 
145 150 155 160 

Asp Pro Leu Gly Gin Thr Glu Lys He Tyr Val Leu Asp Gly Thr Gin 
165 170 175 

Asn Asp Thr Ala Phe Val Phe Pro Arg Leu Arg Asp Phe Thr Leu Ala 



30 



WO 00/78808 



PCT/US00/16883 



180 



185 



190 



Met Ala Ala Arg Lys Ala Ser Arg Val Arg Val Pro Phe Pro Trp Val 
195 200 205 

Gly Thr Gly Gin Leu Val Tyr Gly Gly Phe Leu Tyr Phe Ala Arg Arg 
210 215 220 

Pro Pro Gly Arg Pro Gly Gly Gly Gly Glu Met Glu Asn Thr Leu Gin 
225 230 235 240 

Leu lie Lys Phe His Leu Ala Asn Arg Thr Val Val Asp Ser Ser Val 
245 250 255 

Phe Pro Ala Glu Gly Leu lie Pro Pro Tyr Gly Leu Thr Ala Asp Thr 
260 265 270 

Tyr lie Asp Leu Ala Ala Asp Glu Glu Gly Leu Trp Ala Val Tyr Ala 
275 280 285 

Thr Arg Glu Asp Asp Arg His Leu Cys Leu Ala Lys Leu Asp Pro Gin 
290 295 300 

Thr Leu Asp Thr Glu Gin Gin Trp Asp Thr Pro Cys Pro Arg Glu Asn 
305 310 315 320 

Ala Glu Ala Ala Phe Val He Cys Gly Thr Leu Tyr Val Val Tyr Asn 
325 330 335 

Thr Arg Pro Ala Ser Arg Ala Arg He Gin Cys Ser Phe Asp Ala Ser 
340 345 350 



Gly Pro Xaa 
355 



<210> 44 
<211> 25 
<212> PRT 

<213> Mus musculus 
<400> 44 

Ala Ala Val Val Ala Gly Ala Val Val Gly Thr Phe Val Gly Leu Val 
15 10 15 

Leu He Ala Gly Leu Val Leu Leu Tyr 
20 25 



<210> 45 
<211> 120 
<212> PRT 

<213> Mus musculus 
<400> 45 

Gin Arg Arg Ser Lys Thr Leu Glu Glu Leu Ala Asn Asp He Lys Glu 
15 10 15 
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Asp Ala lie Ala Pro Arg Thr Leu Pro Trp Thr Lys Gly Ser Asp Thr 
20 25 30 

lie Ser Lys Asn Gly Thr Leu Ser Ser Val Thr Ser Ala Arg Ala Leu 
35 40 45 

Arg Pro Pro Lys Ala Ala Pro Pro Arg Pro Gly Thr Phe Thr Pro Thr 
50 55 60 

Pro Ser Val Ser Ser Gin Ala Leu Ser Ser Pro Arg Leu Pro Arg Val 
65 70 75 80 

Asp Glu Pro Pro Pro Gin Ala Val Ser Leu Thr Pro Gly Gly Val Ser 
85 90 95 

Ser Ser Ala Leu Ser Arg Met Gly Ala Val Pro Val Met Val Pro Ala 
100 105 110 



Gin Ser Gin Ala Gly Ser Leu Val 
115 120 



<210> 46 

<211> 1801 

<212> DNA 

<213> Homo sapiens 



<400> 46 

gtcgacccac 

cttgatgcgt 

tccgtggacc 

gctgtgcagg 

tggctcctgt 

ggaaatgtgc 

gctgggttca 

agcgaccgtg 

ctttacctgc 

caacaccaca 

cagctatcct 

agcagcaccc 

agcctatcat 

tcccccaaag 

catgtgtgtg 

ctagacatgt 

gggaccaatc 

aaaacaatag 

acacatagga 

ctgctgctat 

cctcagtggc 

tctccatcag 

acatgtattc 

actctggttt 

tatccagatg 

tgacctctga 

acagtccagt 

gggggtggag 

tggttcctgt 
ctccgcaccc 



gcgtccggcg 
ctgtttgtcc 
ctttcgctgc 
cccttcggtg 
tccagccaat 
cctgagccag 
gcgctgaagt 
gccatcggcc 
tcctgctgct 
actactaccg 
ggaccaacat 
tacccaacgc 
gagacgttgg 
gcagttccct 
tgagtgctat 
ggcttcctct 
tgttttcttc 
ggtaggaggt 
gagcaagctt 
gggtttattc 
agtgggtcag 
ctgtctgtct 
actattcagg 
ctccctacag 
tgtgataatt 
cagtccgttt 
ttgttccagt 
agcctggatg 
ttcactggct 
tcgtttataa 



gaggttgtgg 
gtccgtccgt 
tgctgttgtt 
aagacaattc 
actgctgctc 
agtccagcag 
atcagtccag 
tgaccgtctt 
gtctatataa 
tggttcacac 
accagggcta 
agtaccctcc 
ctggagccag 
gagcctgccc 
gcagagttct 
gctgatgacc 
ctcacttgaa 
atttcccgct 
tttgtgggtc 
ccagggcttg 
tgactgatgt 
ggacggtccc 
ctccagtggc 
tgtcttttta 
ggtgaggttg 
cccttgacac 
agcagggaca 
gtagctctgg 
gttttagttt 
taaatgaata 



ctgcaccgtg 
ccgtcccgcc 
gctactgccg 
gatcccagag 
tgacgtgctg 
attttccgcc 
tcttgacagt 
cgtggtgttt 
gatgtgctgc 
cgcttaccct 
ccatcccatg 
accctacctg 
ccagcctcca 
ccagcctctt 
ttactgctgt 
aggtaggcac 
attgtaattt 
tcaccccaag 
catgtcctgc 
gctgcattta 
cagagcacac 
actgtctttc 
ttccaggcca 
cgattagcca 
aaatccttgg 
cagcttcata 
ccagggccag 
accagatgtg 
tgtgttaatt 
tttggaaaaa 



gtcctgggct 
atggctgcgc 
tctccgggtg 
tcctgtcctg 
aagaaaatcc 
cacccggaga 
gacaacatgc 
atcgctacca 
cgcccacgac 
cagcctcaac 
cccccccagc 
gcccagccca 
tacaacccgg 
tggctaacat 
ctgtggtgcg 
aaatcttacc 
ctgaaatttc 
gtgaccagcc 
tttggggagt 
gctggacaga 
taggcagaga 
ctgggactat 
ggggcctctg 
aacatattgc 
ttcctggaga 
gaatacctga 
gggttatctg 
aatgcctcca 
ggtgtttctg 
aaaaaaaaaa 



tggtcctggg 
cggcgccctc 
cccatggcga 
acttctgttg 
agtggaatga 
caccagaaca 
cagggttcgg 
tcattgtgtg 
ctgtcgtgtc 
ctgtggcccc 
caggaatgcc 

ca gggccacc 

cctacatgga 
ttgattatgt 
tgtgccttgt 
agtgctggtt 
aagtaaatta 
atagcctgcc 
agccagctag 
gaacaagggg 
gccccgtccg 
gtagagggcc 
tctactacac 
ctgttttttg 
acaggaaacc 
ctcctgtact 
gaccaagggt 
tattccctgt 
agcattcaaa 
aaaaaaaaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 
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<210> 47 

<211> 735 

<212> DNA 

<213> Homo sapiens 



1801 



<400> 47 

atgcgtctgt 

tggacccttt 

tgcaggccct 

tcctgttcca 

atgtgccctg 

ggttcagcgc 

accgtggcca 

acctgctcct 

accacaacta 

tatcctggac 

gcaccctacc 

tatcatgaga 

ccaaaggcag 



ttgtccgtcc 
cgctgctgct 
tcggtgaaga 
gccaatactg 
agccagagtc 
tgaagtatca 
tcggcctgac 
gctgctgtct 
ctaccgtggt 
caacatacca 
caacgcagta 
cgttggctgg 
ttccc 



gtccgtccgt 
gttgttgcta 
caattcgatc 
ctgctctgac 
cagcagattt 
gtccagtctt 
cgtcttcgtg 
atataagatg 
tcacaccgct 
gggctaccat 
ccctccaccc 
agccagccag 



cccgccatgg 
ctgccgtctc 
ccagagtcct 
gtgctgaaga 
tccgcccacc 
gacagtgaca 
gtgtttatcg 
tgctgccgcc 
taccctcagc 
cccatgcccc 
tacctggccc 
cctccataca 



ctgcgccggc 
cgggtgccca 
gtcctgactt 
aaatccagtg 
cggagacacc 
acatgccagg 
ctaccatcat 
cacgacctgt 
ctcaacctgt 
cccagccagg 
agcccacagg 
acccggccta 



gccctctccg 60 
tggcgagctg 12 0 
ctgttgtggc 180 
gaatgaggaa 24 0 
agaacagctg 300 
gttcggagcg 360 
tgtgtgcttt 420 
cgtgtccaac 480 
ggcccccagc 540 
aatgccagca 600 
gccaccagcc 660 
catggatccc 720 
735 



<210> 48 
<211> 245 
<212> PRT 

<213> Homo sapiens 
<400> 48 

Met Arg Leu Phe Val Arg Pro Ser Val Arg Pro Ala Met Ala Ala Pro 
15 10 15 

Ala Pro Ser Pro Trp Thr Leu Ser Leu Leu Leu Leu Leu Leu Leu Pro 
20 25 30 

Ser Pro Gly Ala His Gly Glu Leu Cys Arg Pro Phe Gly Glu Asp Asn 
35 40 45 

Ser lie Pro Glu Ser Cys Pro Asp Phe Cys Cys Gly Ser Cys Ser Ser 
50 55 60 

Gin Tyr Cys Cys Ser Asp Val Leu Lys Lys He Gin Trp Asn Glu Glu 
65 70 75 80 

Met Cys Pro Glu Pro Glu Ser Ser Arg Phe Ser Ala His Pro Glu Thr 
85 90 95 

Pro Glu Gin Leu Gly Ser Ala Leu Lys Tyr Gin Ser Ser Leu Asp Ser 
100 105 110 

Asp Asn Met Pro Gly Phe Gly Ala Thr Val Ala He Gly Leu Thr Val 
115 120 125 

Phe Val Val Phe He Ala Thr He He Val Cys Phe Thr Cys Ser Cys 
130 13 5 14 0 



Cys Cys Leu Tyr Lys Met Cys Cys Arg Pro Arg Pro Val Val Ser Asn 
145 150 155 160 
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Thr Thr Thr Thr Thr Val Val His 
165 

Val Ala Pro Ser Tyr Pro Gly Pro 
180 

Pro Pro Gin Pro Gly Met Pro Ala 

195 200 

Pro Pro Tyr Leu Ala Gin Pro Thr 
210 215 

Leu Ala Gly Ala Ser Gin Pro Pro 
225 230 

Pro Lys Ala Val Pro 
245 



Thr Ala Tyr Pro Gin Pro Gin Pro 
170 175 

Thr Tyr Gin Gly Tyr His Pro Met 
185 190 

Ala Pro Tyr Pro Thr Gin Tyr Pro 
205 

Gly Pro Pro Ala Tyr His Glu Thr 
220 

Tyr Asn Pro Ala Tyr Met Asp Pro 
235 240 



<210> 49 
<211> 38 
<212> PRT 

<213> Homo sapiens 
<400> 49 

Met Arg Leu Phe Val Arg Pro Ser Val Arg Pro Ala Met Ala Ala Pro 
15 10 15 

Ala Pro Ser Pro Trp Thr Leu Ser Leu Leu Leu Leu Leu Leu Leu Pro 
20 25 30 

Ser Pro Gly Ala His Gly 
35 



<210> 50 
<211> 207 
<212> PRT 

<213> Homo sapiens 
<400> 50 

Glu Leu Cys Arg Pro Phe Gly Glu Asp Asn Ser lie Pro Glu Ser Cys 
1.5 10 15 

Pro Asp Phe Cys Cys Gly Ser Cys Ser Ser Gin Tyr Cys Cys Ser Asp 
20 25 30 

Val Leu Lys Lys lie Gin Trp Asn Glu Glu Met Cys Pro Glu Pro Glu 
35 40 45 

Ser Ser Arg Phe Ser Ala His Pro Glu Thr Pro Glu Gin Leu Gly Ser 
50 55 60 

Ala Leu Lys Tyr Gin Ser Ser Leu Asp Ser Asp Asn Met Pro Gly Phe 
65 70 75 80 

Gly Ala Thr Val Ala He Gly Leu Thr Val Phe Val Val Phe He Ala 
85 90 95 
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Thr He He Val Cys Phe Thr Cys 
100 

Cys Cys Arg Pro Arg Pro Val Val 

115 120 

Val His Thr Ala Tyr Pro Gin Pro 
130 135 

Gly Pro Thr Tyr Gin Gly Tyr His 
145 150 

Pro Ala Ala Pro Tyr Pro Thr Gin 
165 

Pro Thr Gly Pro Pro Ala Tyr His 
180 

Pro Pro Tyr Asn Pro Ala Tyr Met 
195 200 



Ser Cys Cys Cys Leu Tyr Lys Met 
105 no 

Ser Asn Thr Thr Thr Thr Thr Val 
125 

Gin Pro Val Ala Pro Ser Tyr Pro 
140 

Pro Met Pro Pro Gin Pro Gly Met 
155 160 

Tyr Pro Pro Pro Tyr Leu Ala Gin 
170 175 

Glu Thr Leu Ala Gly Ala Ser Gin 
185 190 

Asp Pro Pro Lys Ala Val Pro 
205 



<210> 51 

<211> 85 

<212> PRT 

<213> Homo sapiens 

<400> 51 

Glu Leu Cys Arg Pro Phe Gly Glu Asp Asn Ser He Pro Glu Ser Cys 
1 5 10 15 

Pro Asp Phe Cys Cys Gly Ser Cys Ser Ser Gin Tyr Cys Cys Ser Asp 
20 25 30 

Val Leu Lys Lys He Gin Trp Asn Glu Glu Met Cys Pro Glu Pro Glu 
35 40 45 

Ser Ser Arg Phe Ser Ala His Pro Glu Thr Pro Glu Gin Leu Gly Ser 
50 55 60 

Ala Leu Lys Tyr Gin Ser Ser Leu Asp Ser Asp Asn Met Pro Gly Phe 
65 70 75 80 

Gly Ala Thr Val Ala 
85 



<210> 52 
<211> 25 
<212> PRT 

<213> Homo sapiens 
<400> 52 

He Gly Leu Thr Val Phe Val Val Phe He Ala Thr He He Val Cys 
15 10 15 

Phe Thr Cys Ser Cys Cys Cys Leu Tyr 
20 * 25 
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<210> 53 
<211> 97 
<212> PRT 

<213> Homo sapiens 
<400> 53 

Lys Met Cys Cys Arg Pro Arg Pro Val Val Ser Asn Thr Thr Thr Thr 
15 10 15 

Thr Val Val His Thr Ala Tyr Pro Gin Pro Gin Pro Val Ala Pro Ser 
20 25 30 

Tyr Pro Gly Pro Thr Tyr Gin Gly Tyr His Pro Met Pro Pro Gin Pro 
35 40 45 

Gly Met Pro Ala Ala Pro Tyr Pro Thr Gin Tyr Pro Pro Pro Tyr Leu 
50 55 60 

Ala Gin Pro Thr Gly Pro Pro Ala Tyr His Glu Thr Leu Ala Gly Ala 
65 70 75 80 

Ser Gin Pro Pro Tyr Asn Pro Ala Tyr Met Asp Pro Pro Lys Ala Val 
85 90 95 

Pro 



<210> 54 
<211> 50 
<212> PRT 

<213> Homo sapiens 
<400> 54 

Cys Arg Pro Phe Gly Glu Asp Asn Ser lie Pro Glu Ser Cys Pro Asp 
15 10 15 

Phe Cys Cys Gly Ser Cys Ser Ser Gin Tyr Cys Cys Ser Asp Val Leu 
20 25 30 

Lys Lys lie Gin Trp Asn Glu Glu Met Cys Pro Glu Pro Glu Ser Ser 
35 40 45 

Arg Phe 
50 



<210> 55 
<211> 56 
<212> PRT 

<213> Homo sapiens 
<400> 55 

Thr Val Phe Val Val Phe He Ala Thr He He Val Cys Phe Thr Cys 
1 5 10 15 

Ser Cys Cys Cys Leu Tyr Lys Met Cys Cys Arg Pro Arg Pro Val Val 
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20 



25 



30 



Ser Asn Thr Thr Thr Thr Thr Val Val His Thr Ala Tyr Pro Gin Pro 
35 40 45 



Gin Pro Val Ala Pro Ser Tyr Pro 
50 55 



<210> 56 

<211> 1858 

<212> DNA 

<213> Mus musculus 



<400> 56 

gtcgacccac 

tgttcgtccg 

tattgctgct 

cctttggtga 

ccaaccaata 

ctgagccaga 

cagcgctgaa 

tcgccattgg 

gctcctgctg 

caactactac 

gctatcctgg 

gcagcaccct 

ccctaccatg 

tccctaaaga 

gtgagtgagt 

catgtggctt 

caacctgttt 

aataaggtag 

taggagagca 

gctaggcctt 

caccaggtga 

tccatcagct 

tgttcactat 

cccaggccag 

ttactgttag 

ggttgaaatc 

gacaccatct 

gggacaccaa 

ctccggccca 

tagctttgtg 

tgcaacattg 



gcgtccgcgc 
tccgttggtc 
gctgttgctg 
agacaattcg 
ctgctgctcg 
gtccagcaga 
atttcgatcc 
cgtgaccatc 
ctgtctgtat 
cgtggttcat 
accaacatac 
acccaacgca 
agtccttggc 
caattccctg 
gatacgcaga 
cctctgctgt 
tcttcctcac 
gaggtatttc 
agctttttgc 
tattcccagg 
caaggggact 
gtctgtctgg 
tcaggctcca 
ggacctcggt 
ccaaacattt 
cctggttcct 
tcatagaaat 
ggccaatggg 
gatgtgaata 
ttgattggtg 
gaaaaaaaaa 



ggaggttgcg 
tgtcccgcca 
ctgccgccgc 
atcccagtgt 
gacgtgctga 
ttttccaccc 
agttttgaca 
tttgtggtgt 
aagatgtgct 
gccccttacc 
cagggctacc 
gtacccacca 
tggagccagc 
aacctgcccc 
gttctttact 
tgaccaggta 
ttgaaattgt 
ccacgtcacc 
gggtacagag 
gtttggctgc 
cagtggcagg 
atgtcactgt 
ctgggggaat 
ctgtctacta 
tgcctgtttt 
ggaggacaga 
acctgactcc 
ttatctggac 
cctccatatt 
tttctgagca 
aaaaaaaaaa 



gcggcaccgt 
tggctgcgcc 
ctccgggtgc 
tctgtcctga 
ggaaaatcca 
ccgcggagga 
gtgaccctat 
ttattgccac 
gcccccaacg 
ctcagcctca 
atcccatgcc 
ccctacctgg 
cagcctccat 
cagcctcttt 
gctgtctgtg 
ggcgcaagtc 
actttctgaa 
ccaaggtgac 
caggctttgg 
attggcagtg 
gggtcacacc 
ccttcccggg 
tttcctacct 
cacactctgg 
ctgtctccag 
caacctgacc 
tgtaccacag 
caaaggtggg 
ccctgttggt 
ttcagactcc 
aaaaaaaaaa 



ggtcttgggc 
ggcgccctct 
ccatggtgag 
tttctgttgt 
gtggaatgag 
gacacccgaa 
gtcagggttc 
tatcatcatc 
ccctgtcgtg 
acctcaacct 
ccccccagcc 
cccagcccac 
acaacccgac 
ggctgccatt 
gtgtgtgtgc 
ttaccagtgt 
atttcaagca 
cagccatggc 
9999taacca 
aggcaggtgg 
aggcagaaca 
gctgtataga 
ttgctggctt 
tttctccctg 
atgtgtgata 
tccgactgtc 
tccagtttgt 
gtggagggcc 
tcctgtttca 
gcaccctcat 
aaaaaaaagg 



ttggtccgtc 

ctgtggaccc 

ctgtgcaggc. 

ggttcctgtt 

gaaatgtgtc 

catctgggtt 

ggagcgaccg 

tgcttcacct 

accaacacca 

gtggccccca 

aggaatgcca 

agggccgcca 

ctacatggat 

tatgtcgtgt 

cttgtctaga 

gggtcgggac 

aattaaaaac 

ctgtcatact 

gctagctgct 

ctgggggtga 

ccatacactc 

gggccacatg 

ggctcctgct 

cactgtcttt 

attggtgtga 

agtttccctt 

cccagtagca 

tagatggtat 

ctggctgttt 

ttctaataaa 

gcggccgc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1858 



<210> 57 

<211> 639 

<212> DNA 

<213> Mus musculus 



<400> 57 

atggctgcgc 

cctccgggtg 

ttctgtcctg 

aggaaaatcc 

cccgcggagg 



cggcgccctc tctgtggacc ctattgctgc tgctgttgct gctgccgccg 60 
cccatggtga gctgtgcagg ccctttggtg aagacaattc gatcccagtg 120 
atttctgttg tggttcctgt tccaaccaat actgctgctc ggacgtgctg 180 
agtggaatga ggaaatgtgt cctgagccag agtccagcag attttccacc 24 0 
agacacccga acatctgggt tcagcgctga aatttcgatc cagttttgac 300 
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agtgacccta tgtcagggtt cggagcgacc gtcgccattg gcgtgaccat ctttgtggtg 360 

tttattgcca ctatcatcat ctgcttcacc tgctcctgct gctgtctgta taagatgtgc 420 

tgcccccaac gccctgtcgt gaccaacacc acaactacta ccgtggttca tgccccttac 480 

cctcagcctc aacctcaacc tgtggccccc agctatcctg gaccaacata ccagggctac 540 

catcccatgc cccccccagc caggaatgcc agcagcaccc tacccaacgc agtacccacc 600 

accctacctg gcccagccca cagggccgcc accctacca 63 9 

<210> 58 
<211> 213 
<212> PRT 

<213> Mus musculus 
<400> 58 

Met Ala Ala Pro Ala Pro Ser Leu Trp Thr Leu Leu Leu Leu Leu Leu 
15 10 15 

Leu Leu Pro Pro Pro Pro Gly Ala His Gly Glu Leu Cys Arg Pro Phe 
20 25 * 30 

Gly Glu Asp Asn Ser lie Pro Val Phe Cys Pro Asp Phe Cys Cys Gly 
35 40 45 

Ser Cys Ser Asn Gin Tyr Cys Cys Ser Asp Val Leu Arg Lys lie Gin 
50 55 60 

Trp Asn Glu Glu Met Cys Pro Glu Pro Glu Ser Ser Arg Phe Ser Thr 
65 70 75 80 

Pro Ala Glu Glu Thr Pro Glu His Leu Gly Ser Ala Leu Lys Phe Arg 
85 90 95 

Ser Ser Phe Asp Ser Asp Pro Met Ser Gly Phe Gly Ala Thr Val Ala 
100 105 110 

He Gly Val Thr He Phe Val Val Phe He Ala Thr He He He Cys 
115 120 125 

Phe Thr Cys Ser Cys Cys Cys Leu Tyr Lys Met Cys Cys Pro Gin Arg 
130 135 140 

Pro Val Val Thr Asn Thr Thr Thr Thr Thr Val Val His Ala Pro Tyr 
145 150 155 160 

Pro Gin Pro Gin Pro Gin Pro Val Ala Pro Ser Tyr Pro Gly Pro Thr 
165 170 175 

Tyr Gin Gly Tyr His Pro Met Pro Pro Pro Ala Arg Asn Ala Ser Ser 
180 185 ~ 190 

Thr Leu Pro Asn Ala Val Pro Thr Thr Leu Pro Gly Pro Ala His Arg 
195 200 * 205 

Ala Ala Thr Leu Pro 
210 



<210> 59 
<211> 26 
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<212> PRT 

<213> Mus musculus 

<400> 59 

Met Ala Ala Pro Ala Pro Ser Leu 
1 5 

Leu Leu Pro Pro Pro Pro Gly Ala 
20 



Trp Thr Leu Leu Leu Leu Leu Leu 
10 15 

His Gly 
25 



<210> 60 

<211> 187 

<212> PRT 

<213> Mus musculus 

<400> 60 

Glu Leu Cys Arg Pro Phe Gly Glu Asp Asn Ser lie Pro Val Phe Cys 
15 10 15 

Pro Asp Phe Cys Cys Gly Ser Cys Ser Asn Gin Tyr Cys Cys Ser Asp 
20 25 30 

Val Leu Arg Lys lie Gin Trp Asn Glu Glu Met Cys Pro Glu Pro Glu 
35 40 45 

Ser Ser Arg Phe Ser Thr Pro Ala Glu Glu Thr Pro Glu His Leu Gly 
50 55 60 

Ser Ala Leu Lys Phe Arg Ser Ser Phe Asp Ser Asp Pro Met Ser Gly 
65 70 75 80 

Phe Gly Ala Thr Val Ala lie Gly Val Thr He Phe Val Val Phe He 
85 90 95 

Ala Thr He He He Cys Phe Thr Cys Ser Cys Cys Cys Leu Tyr Lys 
100 105 110 

Met Cys Cys Pro Gin Arg Pro Val Val Thr Asn Thr Thr Thr Thr Thr 
115 120 125 

Val Val His Ala Pro Tyr Pro Gin Pro Gin Pro Gin Pro Val Ala Pro 
130 135 140 

Ser Tyr Pro Gly Pro Thr Tyr Gin Gly Tyr His Pro Met Pro Pro Pro 
145 150 155 160 

Ala Arg Asn Ala Ser Ser Thr Leu Pro Asn Ala Val Pro Thr Thr Leu 
165 170 175 

Pro Gly Pro Ala His Arg Ala Ala Thr Leu Pro 
180 185 



<210> 61 

<211> 86 

<212> PRT 

<213> Mus musculus 
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<400> 61 

Glu Leu Cys Arg Pro Phe Gly Glu Asp Asn Ser He Pro Val Phe Cys 
15 10 15 

Pro Asp Phe Cys Cys Gly Ser Cys Ser Asn Gin Tyr Cys Cys Ser Asp 
20 25 30 

Val Leu Arg Lys He Gin Trp Asn Glu Glu Met Cys Pro Glu Pro Glu 
35 40 45 

Ser Ser Arg Phe Ser Thr Pro Ala Glu Glu Thr Pro Glu His Leu Gly 
50 55 60 



Ser Ala Leu Lys Phe Arg Ser Ser Phe Asp Ser Asp Pro Met Ser Gly 
65 70 75 80 

Phe Gly Ala Thr Val Ala 
85 



<210> 62 

<211> 25 

<212> PRT 

<213> Mus musculus 



<400> 62 

He Gly Val Thr He Phe Val Val Phe He Ala Thr He He He Cys 
15 10 15 

Phe Thr Cys Ser Cys Cys Cys Leu Tyr 
20 25 



<210> 63 

<211> 76 

<212> PRT 

<213> Mus musculus 



<400> 63 

Lys Met Cys Cys Pro Gin Arg Pro Val Val Thr Asn Thr Thr Thr Thr 
15 10 15 

Thr Val Val His Ala Pro Tyr Pro Gin Pro Gin Pro Gin Pro Val Ala 
20 25 30 

Pro Ser Tyr Pro Gly Pro Thr Tyr Gin Gly Tyr His Pro Met Pro Pro 
35 40 45 

Pro Ala Arg Asn Ala Ser Ser Thr Leu Pro Asn Ala Val Pro Thr Thr 
50 55 60 

Leu Pro Gly Pro Ala His Arg Ala Ala Thr Leu Pro 
65 70 75 



<210> 64 
<211> 50 
<212> PRT 
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<213> Mus musculus 
<4O0> 64 

Cys Pro Asp Phe Cys Cys Gly Ser Cys Ser Asn Gin Tyr Cys Cys Ser 
1 5 10 15 

Asp Val Leu Arg Lys lie Gin Trp Asn Glu Glu Met Cys Pro Glu Pro 
20 25 J 30 

Glu Ser Ser Arg Phe Ser Thr Pro Ala Glu Glu Thr Pro Glu His Leu 
35 40 45 

Gly Ser 
50 



<210> 65 
<211> 56 
<212> PRT 

<213> Mus musculus 
<400> 65 

Cys Phe Thr Cys Ser Cys Cys Cys Leu Tyr Lys Met Cys Cys Pro Gin 
15 10 15 

Arg Pro Val Val Thr Asn Thr Thr Thr Thr Thr Val Val His Ala Pro 
20 25 30 

Tyr Pro Gin Pro Gin Pro Gin Pro Val Ala Pro Ser Tyr Pro Gly Pro 
35 40 45 

Thr Tyr Gin Gly Tyr His Pro Met 
50 55 



<210> 66 

<211> 1927 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> modif ied_base 

<222> all "n" positions 

<223> n=a, c, g, or t 



<400> 66 

ccccgcctcc aaagctaacc ctcgggcttg aggggaagan gctgactgta cgttccttct 60 
actctggcac cactctccag gctgccatgg ggcccagcac ccctctcctc atcttgttcc 120 
ttttgtcatg gtcgggaccc ctccaaggac agcagcacca ccttgtggag tacatggaac 180 
gccgactagc tgctttagag gaacggctgg cccagtgcca ggaccagagt agtcggcatg 240 
ctgctgagct gcgggacttc aagaacaaga tgctngccac tgctggaggt ggcagagaag 300 
gagcgggagg cactcagaac tgaggccgac accatctccg ggagagtgga tcgtctggag 3 60 
cgggaggtag actatctgga gacccagaac ccagctctgc cctgtgtaga gtttgatgag 420 
aaggttgact ggaggccctg ggaccaaagg caagggaaga aggaatgaga agtacgatat 4 80 
ggtgacagac tgtggctaca caatctctca agtgagatca atgaagattc tgaagcgatt 540 
tggtggccca gctggtctat ggaccaagga tccactgggg caaacagaga agatctacgt 600 
gttagatggg acacagaatg acacagcctt tgtcttccca aggctgcgtg acttcaccct 660 
tgccatggct gcccggaaag cttcccgagt ccgggtgccc ttcccctggg taggcacagg 720 
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gcagctggta tatggtggct ttctttattt tgctcggagg cctcctggaa gacctggtgg 780 

aggtggtgag atggagaaca ctttgcagct aatcaaattc cacctggcaa accgaacagt 84 0 

ggtggacagc tcagtattcc cagcagaggg gctgatcccc ccctacggct tgacagcaga 900 

cacctacatc gacctggcag ctgatgagga aggtctttgg gctgtctatg ccacccggga 960 

ggatgacagg cacttgtgtc tggccaagtt agatccacag acactggaca cagagcagca 1020 

gtgggacaca ccatgtccca gagagaatgc tgaggctgcc tttntcatct gtgggaccct 1080 

ctatgtcgtc tataacaccc gtcctgccag tcgggcccgc atccagtgct cctttgatgc 1140 

cagcggaccc tgacccctga acgggcagca ctcccttatt ttccccgcag atatggtgcc 12 00 

catgccagcc tccgctataa cccccgagaa cgccagctct atgcctggga tgatcgctac 1260 

cagattgtct ataagctgga gatgaggaag aaagaggagg aggtttgagg agctagcctt 13 2 0 

gttttttgca tctttctcac tcccatacat ttatattata tccccactaa atttcttgtt 1380 

cctcattctt caaatgtggg ccagttgtgg ctcaaatcct ctatattttt agccaatggc 1440 

aatcaaattc tttcagctcc tttgtttcat acggaactcc agatcctgag taatcctttt 1500 

agagcccgaa gagtcaaaac cctcaatgtt ccctcctgct ctcctgcccc atgtcaacaa 1560 

atttcaggct aaggatgccc cagacccagg gctctaacct tgtatgcggg caggcccagg 1620 

gagcaggcag cagtgttctt cccctcagag tgacttgggg agggagaaat aggaggagac 1680 

gtccagctct gtcctctctt cctcactcct cccttcagtg tcctcaggaa caggactttc 1740 

tccacattgt tttgtattgc aacattttgc attaaaaagg aaaatccana aaaaaaaaaa 1800 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1860 

aaactgcggc cgctgtccct tctgtcgtct tctcgcagcc gtacccttct gtcgtcttct 1920 

cgcagcc 1927 

<210> 67 
<211> 319 
<212> PRT 

<213> Homo sapiens 
<400> 67 

Met Val Gly Lys Met Trp Pro Val Leu Trp Thr Leu Cys Ala Val Arg 
1 5 10 15 

Val Thr Val Asp Ala He Ser Val Glu Thr Pro Gin Asp Val Leu Arg 
20 25 30 

Ala Ser Gin Gly Lys Ser Val Thr Leu Pro Cys Thr Tyr His Thr Ser 
35 40 45 

Thr Ser Ser Arg Glu Gly Leu He Gin Trp Asp Lys Leu Leu Leu Thr 
50 55 60 

His Thr Glu Arg Val Val He Trp Pro Phe Ser Asn Lys Asn Tyr He 
65 70 75 80 

His Gly Glu Leu Tyr Lys Asn Arg Val Ser He Ser Asn Asn Ala Glu 
85 90 95 

Gin Ser Asp Ala Ser He Thr He Asp Gin Leu Thr Met Ala Asp Asn 
100 105 110 

Gly Thr Tyr Glu Cys Ser Val Ser Leu Met Ser Asp Leu Glu Gly Asn 
115 120 125 

Thr Lys Ser Arg Val Arg Leu Leu Val Leu Val Pro Pro Ser Lys Pro 
130 135 140 



Glu Cys Gly He Glu Gly Glu Thr He He Gly Asn Asn He Gin Leu 
145 150 155 160 
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Thr Cys Gin 
Arg Tyr Asn 



Gly Gin Pro 
195 

Tyr lie Cys 
210 

Thr Val Ala 
225 



Ser Lys Glu Gly Ser Pro Thr Pro Gin Tyr Ser Trp Lys 
165 170 175 

lie Leu Asn Gin Glu Gin Pro Leu Ala Gin Pro Ala Ser 
180 185 190 

Val Ser Leu Lys Asn lie Ser Thr Asp Thr Ser Gly Tyr 
200 205 

Thr Ser Ser Asn Glu Glu Gly Thr Gin Phe Cys Asn lie 
215 220 

Val Arg Ser Pro Ser Met Asn Val Ala Leu Tyr Val Gly 
230 235 240 



He Ala Val Gly Val Val Ala Ala Leu He He He Gly He He He 
245 250 255 



Tyr Cys Cys 



Asp Ala Arg 
275 

Arg Glu Leu 
290 

Glu Gin Arg 
305 



Cys Cys Arg Gly 
260 

Pro Asn Arg Glu 



Lys Asp Asp Asn Thr Glu Asp Lys Glu 
265 270 

Ala Tyr Glu Glu Pro Pro Glu Gin Leu 
280 285 



Ser Arg Glu Arg Glu Glu Glu Asp Asp Tyr Arg Gin Glu 
295 300 

Ser Thr Gly Arg Glu Ser Pro Asp His Leu Asp Gin 
310 315 



<210> 68 
<211> 2793 
<212> DNA 

<213> Homo sapiens 
<400> 68 

ctaccccttt gtgagcagtc taggactttg tacacctgtt aagtagggag aaggcagggg 60 
aggtggctgg tttaagggga acttgaggga agtagggaag actcctcttg ggacctttgg 12 0 
agtaggtgac acatgagccc agccccagct cacctgccaa tccagctgag gagctcacct 180 
gccaatccag ctgaggctgg gcagaggtgg gtgagaagag ggaaaattgc agggacctcc 240 
agttgggcca ggccagaagc tgctgtagct ttaaccagac agctcagacc tgtctggagg 3 00 
ctgccagtga caggttaggt ttagggcaga gaagaagcaa gaccatggtg gggaagatgt 3 60 
ggcctgtgtt gtggacactc tgtgcagtca gggtgaccgt cgatgccatc tctgtggaaa 420 
ctccgcagga cgttcttcgg gcttcgcagg gaaagagtgt caccctgccc tgcacctacc 480 
acacttccac ctccagtcga gagggactta ttcaatggga taagctcctc ctcactcata 540 
cggaaagggt ggtcatctgg ccgttttcaa acaaaaacta catccatggt gagctttata 600 
agaatcgcgt cagcatatcc aacaatgctg agcagtccga tgcctccatc accattgatc 660 
agctgaccat ggctgacaac ggcacctacg agtgttctgt ctcgctgatg tcagacctgg 72 0 
agggcaacac caagtcacgt gtccgcctgt tggtcctcgt gccaccctcc aaaccagaat 780 
gcggcatcga gggagagacc ataattggga acaacatcca gctgacctgc caatcaaagg 84 0 
agggctcacc aacccctcag tacagctgga agaggtacaa catcctgaat caggagcagc 900 
ccctggccca gccagcctca ggtcagcctg tctccctgaa gaatatctcc acagacacat 960 
cgggttacta catctgtacc tccagcaatg aggaggggac gcagttctgc aacatcacgg 102 0 
tggccgtcag atctccctcc atgaacgtgg ccctgtatgt gggcatcgcg gtgggcgtgg 1080 
ttgcagccct cattatcatt ggcatcatca tctactgctg ctgctgccga gggaaggacg 1140 
acaacactga agacaaggag gatgcaaggc cgaaccggga agcctatgag gagccaccag 1200 
agcagctaag agaactttcc agagagaggg aggaggagga tgactacagg caagaagagc 1260 
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agaggagcac 
ggcggcggag 
aagccctgtt 
cagctgtggg 
actgtcctgt 
aagaccaggc 
ctgagcatgg 
gggggctgtc 
cactccacac 
tggcctggcc 
tccaaaccct 
gcccacacac 
aatcacaagg 
actctgtccc 
tgtattagag 
aagccctagt 
tgattggttg 
tcccacccaa 
cacacaaggg 
ggccagaata 
cgatacctca 
cctgctgccc 
caggcagcgt 
tttactattg 
ttgcgatttt 
aaatgctttc 



tgggcgtgaa 
gaagggttag 
ctcctgtccc 
gaacatggct 
gaagtaaccc 
tgcagcctcc 
ggcgctccca 
ctgctcacct 
atctttcttg 
tccagctccc 
cctgggaagg 
tttccatctc 
atttctctaa 
acgtccaatc 
cgccagctcc 
agctggcgcc 
tgccttccct 
acccacgtat 
agcttgcttg 
attggcatgt 
gccctggccc 
tcccacagcc 
tagggctgct 
ctatcttttc 
ttaaaaaatg 
agtaataata 



tccccggacc 
gggttcattc 
tccatcccag 
ggcctggtaa 
ctcctggctg 
acttccctcg 
ctcagaactc 
gtgtgcccag 
aatgaatgaa 
actccctttc 
ccacctccca 
tgcctgtcaa 
ccctatccta 
atgggccaca 
ttggggcagg 
catcctagtg 
tctctggtct 
tcattcagtg 
cagatggtct 
ctcctcaacc 
tgcccagccc 
tccttctgtt 
taggtctcat 
tggatgatca 
tatattttta 
aaattgtggg 



acctcgacca 
tcccgcttcc 
acattgatgg 
gggggtccct 
tgacacctgg 
tagttggcag 
tccagggagg 
cacctggagg 
agaataagtg 
caacctcact 
ctcctgctgc 
tatcgtacct 
attgtccaca 
aggcacagtc 
gcctgggcct 
ggcacttaag 
ccttgagatg 
agttaaacac 
gagttcttgt 
cacatggggt 
atttgggctc 
tgtcgagcat 
ggaccagtgg 
gaaaaataat 
tatatattgt 
tgg 



gtgacaggcc 
tggcctccct 
ggacatttct 
gtgctgatcc 
tgcgggcctg 
gagctcctgg 
cgatgccagc 
ggcaccaggt 
agtatgcttg 
tcccgtagct 
acaggccctg 
gtccctccag 
tacgtggaaa 
ttctgagcga 
catggctttt 
cttaattggg 
atcgtagaca 
gaattgattt 
gtcctggtaa 
tcctggttgt 
tggttttctg 
ttcttctact 
ctggtctcac 
tccataaatc 
taaatccttt 



agcagcagag 
tctcctttct 
tccccagtgt 
tgctgacctc 
gccctcactc 
aagcacagcg 
cttggggggt 
ggagggtttg 
ggccctgcat 
gccagtatgt 
gggagctttt 
gcccatctca 
caatcctgtt 
gtgctctcac 
gctttccctg 
gaaactgctt 
cagggatgat 
aaagtgaaca 
ttcctctcca 
tcctgcatcc 
gtggggctgt 
cttgagagct 
ccaactgcag 
tattgtctac 
gcttcattcc 



1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2793 



<210> 69 
<211> 52 
<212> PRT 

<213> Homo sapiens 
<400> 69 

Lys Thr Ala Leu Gly Glu Leu Leu Lys Pro Leu Asn Ser Glu Tyr Gly 
1 5 10 15 

Lys Val Ala Pro Gly Trp Gly Thr Thr Pro Leu Met Gly Val Phe Met 
20 25 30 

Ala Leu Phe Ala Val Phe Leu Leu lie lie Leu Glu lie Tyr Asn Ser 
35 40 45 



Ser Val Leu Leu 
50 



<210> 70 

<211> 1832 

<212> DNA 

<213> Homo sapiens 
<220> 

<221> modif ied_base 

<222> all "n" positions 

<223> n=a, c, g, or t 

<400> 70 

tgtggctgac gtcatctgga ggagatttgc tttctttttc tccaaaaggg gaggaaattg 60 
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aaactgcagt ggcccacgat gggaagaggg gaaagcccag gggtacagga ggcctctggg 120 
tgaaggcaga ggctaacatg gggttcggag cgaccttggc cgttggctga ccatctttgt 180 
gctgtctgtc gtcactatca tcatctgctt cacctgctcc tgctgctgcc tttacaagac 240 
gtgccgccga ccacgtccgg ttgtcaccac caccacatcc accactgtgg tgcatgcccc 300 
ttatcctcag cctccaagtg tgccgcccag ctaccctgga ccaagctacc agggctacca 360 
caccatgccg cctcagccag ggatgccagc agcaccctac ccaatgcagt acccaccacc 420 
ttacccagcc cagcccatgg gcccaccggc ctaccacgag accctggctg gaggagcagc 4 80 
cgcgccctam cccgscagcc agcctcctta caacccggcc tacatggatg cccgaagcgg 540 
ccctctgagc attccctggc ctctytggct gccacttggt tatgttgtgt gtgtgcgtra 600 
gtggtgtgca ggcgcggttc cttacgcccc atgtgtgctg tgtgtgtcca ggcacggttc 660 
cttacgcccc atgtgtgctg tgtgtgtcct gcctgtatat gtggcttcct ctgatgctga 72 0 
caagtgggga acaatccttg ccagagtggg ctgggaccag actttgttct cttcctcacc 780 
tgaaattatg cttcctaaaa tctcaagcca aactcaaaga atggggtggt ggggggcacc 84 0 
ctgtgaggtg gcccctgaga ggtgggggcc tctccagggc acatctggag ttcttctcca 900 
gcttacccta gggtgaccaa gtagggcctg tcacaccagg gtggcgcast ttctgtgtga 960 
tgcagatgtg tcctggtttc ggcagcgtag ccagctgctg cttgaggcca tggctcgtcc 1020 
ccggagttgg gggtacccgt tgcagagcca gggacatgat gcaggcgaag yttgggatct 1080 
ggccaagttg gactttgatc ctttgggcag atgtcccatt gctccctgga gcctgtcatg 1140 
cctgttgggg atcaggcagc ctcctgatgc cagaacacct caggcagagc cctactcagc 1200 
tgtacctgtc tgcctggact gtcccctgtc cccgcatctc ccctgggacc agctggaggg 1260 
ccacatgcac acacagccta gctgccccca gggagctctg ctgcccttgc tggccctgcc 1320 
cttcccacag gtgagcaggg ctcctgtcca ccagcacact cagttctctt ccctgcagtg 13 80 
ttttcatttt attttagcca aacattttgc ctgttttctg tttcaaacat gatagttgat 1440 
atgagactga aacccctggg ttgtggaggg aaattggctc agagatggac aacctggcaa 1500 
ctgtgagtcc ctgcttcccg acaccagcct catggaatat gcaacaactc ctgtacccca 1560 
gtccacggtg ttctggcagc agggacacct gggccaatgg gccatctgga ccaaaggtgg 1620 
ggtgtggggc cctggatggc agctctggcc cagacatgaa tacctcgtgt tcctcctccc 1680 
tctattactg tttcaccaga gctgtcttag ctcaaatctg ttgtgtttct gagtctaggg 1740 
tctgtacact tgtttataat aaatgcaatc gtttnggaaa aaaaananaa aaaaaaaagg 1800 
ggsggcgctc taaaaggatn ccccnaaggg gg 1832 

<210> 71 
<211> 51 
<212> PRT 

<213> Mus musculus 
<400> 71 

Ser Pro Arg Ser Lys Pro Thr Ala Gin Tyr Gin Trp Glu Arg Leu Ala 
15 10 15 

Pro Ser Ser Gin Val Phe Phe Gly Pro Ala Leu Asp Ala Val Arg Gly 
20 25 30 

Ser Leu Lys Leu Thr Asn Leu Ser He Ala Met Ser Gly Val Tyr Val 
35 40 45 

Cys Lys Ala 
50 



<210> 72 
<211> 2557 
<212> DNA 

<213> Homo sapiens 
<400> 72 

gaattccggg agaagtgacc agagcaattt ctgcttttca cagggcgggt ttctcaacgg 60 
tgacttgtgg gcagtgcctt ctgctgagcg agtcatggcc cgaaggcaga actaactgtg 120 
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cctgcagtct tcactctcag gatgcagccg aggtgggccc aaggggccac gatgtggctt 180 
ggagtcctgc tgacccttct gctctgttca agccttgagg gtcaagaaaa ctctttcaca 240 
atcaacagtg ttgacatgaa gagcctgccg gactggacgg tgcaaaatgg gaagaacctg 3 00 
accctgcagt gcttcgcgga tgtcagcacc acctctcacg tcaagcctca gcaccagatg 360 
ctgttctata aggatgacgt gctgttttac aacatctcct ccatgaagag cacagagagt 420 
tattttattc ctgaagtccg gatctatgac tcagggacat ataaatgtac tgtgattgtg 4 80 
aacaacaaag agaaaaccac tgcagagtac cagctgttgg tggaaggagt gcccagtccc 54 0 
agggtgacac tggacaagaa agaggccatc caaggtggga tcgtgagggt caactgttct 600 
gtcccagagg aaaaggcccc aatacacttc acaattgaaa aacttgaact aaatgaaaaa 660 
atggtcaagc tgaaaagaga gaagaattct cgagaccaga attttgtgat actggaattc 72 0 
cccgttgagg aacaggaccg cgttttatcc ttccgatgtc aagctaggat catttctggg 780 
atccatatgc agacctcaga atctaccaag agtgaactgg tcaccgtgac ggaatccttc 84 0 
tctacaccca agttccacat cagccccacc ggaatgatca tggaaggagc tcagctccac 900 
attaagtgca ccattcaagt gactcacctg gcccaggagt ttccagaaat cataattcag 960 
aaggacaagg cgattgtggc ccacaacaga catggcaaca aggctgtgta ctcagtcatg 1020 
gccatggtgg agcacagtgg caactacacg tgcaaagtgg agtccagccg catatccaag 1080 
gtcagcagca tcgtggtcaa cataacagaa ctattttcca agcccgaact ggaatcttcc 1140 
ttcacacatc tggaccaagg tgaaagactg aacctgtcct gctccatccc aggagcacct 1200 
ccagccaact tcaccatcca gaaggaagat acgattgtgt cacagactca agatttcacc 1260 
aagatagcct caaagtcgga cagtgggacg tatatctgca ctgcaggtat tgacaaagtg 1320 
gtcaagaaaa gcaacacagt ccagatagtc gtatgtgaaa tgctctccca gcccaggatt 13 80 
tcttatgatg cccagtttga ggtcataaaa ggacagacca tcgaagtccg ttgcgaatcg 1440 
atcagtggaa ctttgcctat ttcttaccaa cttttaaaaa caagtaaagt tttggagaat 1500 
agtaccaaga actcaaatga tcctgcggta ttcaaagaca accccactga agacgtcgaa 1560 
taccagtgtg ttgcagataa ttgccattcc catgccaaaa tgttaagtga ggttctgagg 1620 
gtgaaggtga tagccccggt ggatgaggtc cagatttcta tcctgtcaag taaggtggtg 1680 
gagtctggag aggacattgt gctgcaatgt gctgtgaatg aaggatctgg tcccatcacc 1740 
tataagtttt acagagaaaa agagggcaaa cccttctatc aaatgacctc aaatgccacc 1800 
caggcatttt ggaccaagca gaaggctagc aaggaacagg agggagagta ttactgcaca 1860 
gccttcaaca gagccaacca cgcctccagt gtccccagaa gcaaaatact gacagtcaga 1920 
gtcattcttg ccccatggaa gaaaggactt attgcagtgg ttatcatcgg agtgatcatt 1980 
gctctcttga tcattgcggc caaatgttat tttctgagga aagccaaggc caagcagatg 2040 
ccagtggaaa tgtccaggcc agcagtacca cttctgaact ccaacaacga gaaaatgtca 2100 
gatcccaata tggaagctaa cagtcattac ggtcacaatg acgatgtcag aaaccatgca 2160 
atgaaaccaa taaatgataa taaagagcct ctgaactcag acgtgcagta cacggaagtt 2220 
caagtgtcct cagctgagtc tcacaaagat ctaggaaaga aggacacaga gacagtgtac 2280 
agtgaagtcc ggaaagctgt ccctgatgcc gtggaaagca gatactctag aacggaaggc 234 0 
tcccttgatg gaacttagac agcaaggcca gatgcacatc cctggaagga catccatgtt 2400 
ccgagaagaa cagataatcc ctgtatttca agacctctgt gcacttattt atgaacctgc 2460 
cctgctccca cagaacacag caattcctca ggctaagctg ccggttctta aatccatcct 2520 
gctaagttaa tgttgggtag aaagagatac agagggg 2557 

<210> 73 
<211> 738 
<212> PRT 

<213> Homo sapiens 
<400> 73 

Met Gin Pro Arg Trp Ala Gin Gly Ala Thr Met Trp Leu Gly Val Leu 
15 10 15 

Leu Thr Leu Leu Leu Cys Ser Ser Leu Glu Gly Gin Glu Asn Ser Phe 
20 25 30 

Thr lie Asn Ser Val Asp Met Lys Ser Leu Pro Asp Trp Thr Val Gin 
35 40 ~ 45 

Asn Gly Lys Asn Leu Thr Leu Gin Cys Phe Ala Asp Val Ser Thr Thr 
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50 



55 



60 



Ser His Val Lys Pro Gin His Gin Met Leu Phe Tyr Lys Asp Asp Val 
65 70 75 80 

Leu Phe Tyr Asn lie Ser Ser Met Lys Ser Thr Glu Ser Tyr Phe lie 
85 90 95 

Pro Glu Val Arg lie Tyr Asp Ser Gly Thr Tyr Lys Cys Thr Val lie 
100 105 110 

Val Asn Asn Lys Glu Lys Thr Thr Ala Glu Tyr Gin Leu Leu Val Glu 
115 120 125 

Gly Val Pro Ser Pro Arg Val Thr Leu Asp Lys Lys Glu Ala lie Gin 
130 135 140 

Gly Gly lie Val Arg Val Asn Cys Ser Val Pro Glu Glu Lys Ala Pro 
145 150 155 160 

lie His Phe Thr lie Glu Lys Leu Glu Leu Asn Glu Lys Met Val Lys 
165 170 175 

Leu Lys Arg Glu Lys Asn Ser Arg Asp Gin Asn Phe Val lie Leu Glu 
180 185 190 

Phe Pro Val Glu Glu Gin Asp Arg Val Leu Ser Phe Arg Cys Gin Ala . 
195 200 205 

Arg lie lie Ser Gly lie His Met Gin Thr Ser Glu Ser Thr Lys Ser 
210 215 220 

Glu Leu Val Thr Val Thr Glu Ser Phe Ser Thr Pro Lys Phe His lie 
225 230 235 240 

Ser Pro Thr Gly Met lie Met Glu Gly Ala Gin Leu His lie Lys Cys 
245 250 255 

Thr He Gin Val Thr His Leu Ala Gin Glu Phe Pro Glu He He He 
260 265 270 

Gin Lys Asp Lys Ala He Val Ala His Asn Arg His Gly Asn Lys Ala 
275 280 285 

Val Tyr Ser Val Met Ala Met Val Glu His Ser Gly Asn Tyr Thr Cys 
290 295 300 

Lys Val Glu Ser Ser Arg He Ser Lys Val Ser Ser He Val Val Asn 
305 310 315 320 

He Thr Glu Leu Phe Ser Lys Pro Glu Leu Glu Ser Ser Phe Thr His 
325 330 335 

Leu Asp Gin Gly Glu Arg Leu Asn Leu Ser Cys Ser He Pro Gly Ala 
340 345 350 



Pro Pro Ala Asn Phe Thr He Gin Lys Glu Asp Thr He Val Ser Gin 
355 360 365 
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Thr Gin Asp Phe Thr Lys He Ala Ser Lys Ser Asp Ser Gly Thr Tyr 
370 375 380 

He Cys Thr Ala Gly He Asp Lys Val Val Lys Lys Ser Asn Thr Val 
385 390 395 400 

Gin He Val Val Cys Glu Met Leu Ser Gin Pro Arg He Ser Tyr Asp 
405 410 415 

Ala Gin Phe Glu Val He Lys Gly Gin Thr He Glu Val Arg Cys Glu 
420 425 430 

Ser He Ser Gly Thr Leu Pro He Ser Tyr Gin Leu Leu Lys Thr Ser 
435 440 445 

Lys Val Leu Glu Asn Ser Thr Lys Asn Ser Asn Asp Pro Ala Val Phe 
450 455 460 

Lys Asp Asn Pro Thr Glu Asp Val Glu Tyr Gin Cys Val Ala Asp Asn 
465 470 475 480 

Cys His Ser His Ala Lys Met Leu Ser Glu Val Leu Arg Val Lys Val 
485 490 495 

He Ala Pro Val Asp Glu Val Gin He Ser He Leu Ser Ser Lys Val 
500 505 510 

Val Glu Ser Gly Glu Asp He Val Leu Gin Cys Ala Val Asn Glu Gly 
515 520 525 

Ser Gly Pro He Thr Tyr Lys Phe Tyr Arg Glu Lys Glu Gly Lys Pro 
530 535 540 

Phe Tyr Gin Met Thr Ser Asn Ala Thr Gin Ala Phe Trp Thr Lys Gin 
545 550 555 560 

Lys Ala Ser Lys Glu Gin Glu Gly Glu Tyr Tyr Cys Thr Ala Phe Asn 
565 570 575 

Arg Ala Asn His Ala Ser Ser Val Pro Arg Ser Lys He Leu Thr Val 
580 585 590 

Arg Val He Leu Ala Pro Trp Lys Lys Gly Leu He Ala Val Val He 
595 600 605 

He Gly Val He He Ala Leu Leu He He Ala Ala Lys Cys Tyr Phe 
610 615 620 

Leu Arg Lys Ala Lys Ala Lys Gin Met Pro Val Glu Met Ser Arg Pro 
625 630 635 640 

Ala Val Pro Leu Leu Asn Ser Asn Asn Glu Lys Met Ser Asp Pro Asn 
645 ' 650 * 655 

Met Glu Ala Asn Ser His Tyr Gly His Asn Asp Asp Val Arg Asn His 
660 665 670 

Ala Met Lys Pro He Asn Asp Asn Lys Glu Pro Leu Asn Ser Asp Val 
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675 680 685 

Gln.Tyr Thr Glu Val Gin Val Ser Ser Ala Glu Ser His Lys Asp Leu 
690 695 700 

Gly Lys Lys Asp Thr Glu Thr Val Tyr Ser Glu Val Arg Lys Ala Val 
705 710 715 720 

Pro Asp Ala Val Glu Ser Arg Tyr Ser Arg Thr Glu Gly Ser Leu Asp 
725 730 735 

Gly Thr 



<210> 74 
<211> 601 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> modif ied_base 
<222> all "n" positions 
<223> n=a, c, g, or t 



<400> 74 

gnnnnnnagg tntanagncn cctttacncc 
gctgtggagc aagaagcaac ccgaagctag 
gttggggtag gagtgggagc agggccagca 
cagctgggag agctggggag ccgggaaggg 
ctgggcctcc tgggtcatca ccatgaggcc 
atcaggctct cctcctctgg acgacaacaa 
cctcccaggc acaccaggcc accacggcag 
tggccgcgac ggtgcacccg gagctccggg 
acctgggcca cgtngggagc ccgggccgcg 
gcctggnggg gaatgctcgg tgccccacga 
c 

<210> 75 
<211> 732 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> modif ied__base 
<222> all "n" positions 
<223> n=a, c, g, or t 



gccgcggacg cgtgggcgga cgcgtggggt 60 

gagtctgtca gcgagggcag gggctgcctg 12 0 

99agggtctg aggaagccat tcaaagcgag 180 

cctacagact acaagagagg atcctggcgt 240 

acttcttgcc ctgctgcttc tgggtctggc 300 

gatccccagc ctgtgtcccg ggcagcccgg 360 

ccaaggcctg cctggccgtg acggccgtga 420 

agagaaaggc gagggcggga gaccgggact 4 80 

tggagaggca ggacctgtgg gggctatcgg 54 0 

tcagcttcag tgccaagcga tcagaaagcc 600 

601 



<400> 75 

gngngttnnn ttccncctcc gacttaaggc 
cttcttcctt ttgtcatggc cgggacccct 
catggaacgc cgactagctg ccttagagga 
tcggcatgct gctgagcttc gggacttcaa 
agagaaggag cgggaaacac tcagaaccga 
tcttgaacgg gaagtagact acctggagac 
ggatgagaag gtgactggag gccctggaac 
cgatatggtg acagactgta gctacacaat 
gcggtttggt ggctcagctg gcctatggac 



tgccatgggg cccagtgctc ctctgctcct 60 

tcagggacag cagcaccacc ttgtggagta 12 0 

gcggctggca cagtgccagg atcagagcag 18 0 

aaacaagatg ctgcctctac tggaggtggc 240 

ggcagacagc atttcaggaa gagtggaccg 300 

acagaaccca gctttgccct gtgtagaact 360 

caaaggcaag ggccggagaa atgagaaata 42 0 

ctctcaggtg aggtcaatga agatcctgaa 480 

caaggatcca ctggggccag canagaagat 54 0 
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ctacgtgtta gacggnacgc agaacgacac 
ccctcaccat ggctgccgca aagttccgaa 
aactggtgtn tgtggcttcc tttttatctc 
nggtggnaaa at 



ggccttcgtt ttccganggt gcgtgactta 600 

tcgggtgccc ttncctgggt agnacaagaa 660 

aangcntctg gaggaacttg nanggggggn 720 

732 



<210> 76 

<211> 177 

<212> PRT 

<213> Homo sapiens 



<400> 76 



Gin 


Leu 


Gin 


Leu His 


Leu 


Pro 


Ala 


Asn 


Arg 


Leu 


Gin 


Ala 


Val 


Glu 


Glu 


1 






5 










10 










15 




Gly Glu 


Ser 


Gly Ala 


Ser 


Ala 


Trp 


Tyr 


Thr 


Leu 


His 


Arg 


Glu 


Val 


Ser 








20 








25 < 










30 






Ser 


Ser 


Gin 
35 


Pro Trp 


Glu 


Val 


Pro 
40 


Phe 


Val 


Met 


Trp 


Phe 
45 


Phe 


Lys 


Gin 


Lys 


Glu 
50 


Lys 


Glu Asp 


Gin 


Val 
55 


Leu 


Ser 


Tyr 


He 


Asn 
60 


Gly 


Val 


Thr 


Thr 


Ser Lys 


Pro 


Gly Val 


Ser 


Leu 


Val 


Tyr 


Ser 


Met 


Pro 


Ser 


Arg 


Asn 


Leu 


65 








70 










75 










80 


Ser 


Leu 


Arg 


Val Glu 
85 


Gly 


Leu 


Gin 


Glu 


Lys 
90 


Asp 


Ser 


Gly 


Pro 


Tyr 
95 


Ser 


Cys 


Ser 


Val 


Asn Val 
100 


Gin 


Asp 


Lys 


Gin 
105 


Gly 


Lys 


Ser 


Arg 


Gly 
110 


His 


Ser 


lie 


Lys 


Thr 
115 


Leu Glu 


Leu 


Asn 


Val 
120 


Leu 


Val 


Pro 


Pro 


Ala 
125 


Pro 


Pro 


Ser 


Cys Arg 


Leu 


Gin Gly 


Val 


Pro 


His 


Val 


Gly 


Ala 


Asn 


Val 


Thr 


Leu 


Ser 




130 








135 










140 










Cys 


Gin 


Ser 


Pro Arg 


Ser 


Lys 


Pro 


Ala 


Val 


Gin 


Tyr 


Gin 


Trp 


Asp 


Arg 


145 








150 










155 










160 


Gin 


Leu 


Pro 


Ser Phe 
165 


Gin 


Thr 


Phe 


Phe 


Ala 
170 


Pro 


Ala 


Leu 


Asp 


Val 
175 


He 



Arg 



<210> 77 

<211> 735 

<212> DNA 

<213> Homo sapiens 

<400> 77 

atgcgtctgt ttgtccgtcc gtccgtccgt cccgccatgg ctgcgccggc gccctctccg 60 

tggacccttt cgctgctgct gttgttgcta ctgccgtctc cgggtgccca tggcgagctg 120 

tgcaggccct tcggtgaaga caattcgatc ccagagtcct gtcctgactt ctgttgtggc 180 

tcctgttcca gccaatactg ctgctctgac gtgctgaaga aaatccagtg gaatgaggaa 240 

atgtgccctg agccagagtc cagcagattt tccgcccacc cggagacacc agaacagctg 300 

ggttcagcgc tgaagtatca gtccagtctt gacagtgaca acatgccagg gttcggagcg 360 

accgtggcca tcggcctgac cgtcttcgtg gtgtttatcg ctaccatcat tgtgtgcttt 420 

acctgctcct gctgctgtct atataagatg tgctgccgcc cacgacctgt cgtgtccaac 480 

accacaacta ctaccgtggt tcacaccgct taccctcagc ctcaacctgt ggcccccagc 540 

tatcctggac caacatacca gggctaccat cccatgcccc cccagccagg aatgccagca 600 

gcaccctacc caacgcagta ccctccaccc tacctggccc agcccacagg gccaccagcc 660 

tatcatgaga cgttggctgg agccagccag cctccataca acccggccta catggatccc 72 0 

ccaaaggtag ttccc 735 

<210> 78 
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<211> 18 
<212> PRT 

<213> Homo sapiens 
<400> 78 

Gly Ser Leu Ser Leu Thr Asn Leu Ser Ser Ser Met Ala Gly Val Tyr 

1 5 10 15 

Val Cys 



<210> 79 
<211> 22 
<212> PRT 

<213> Homo sapiens 
<400> 79 

Lys Ala His Asn Glu Val Gly Thr Ala Gin Cys Asn Val Thr Leu Glu 

15 10 15 

Val Ser Thr Gly Pro Gly 
20 

<210> 80 
<211> 728 
<212> DNA 

<213> Homo sapiens 

<400> 80 
atgaggccac tcctcgtcct 
gacaacaaga tccccagcct 
catggcagcc agggcttgcc 
gctccgggag agaaaggcga 
ggccgcgagg agaggcggga 
ctccgcgatc cgccttcagc 
cacccttgcc cttcgaccgc 
gcaagttcac ctgccaggtg 
gggccagcct gcagtttgat 
ttttcggggg gtggcccaag 
ctgaggacca agtgtgggtg 
tcaagacaga cagcaccttc 
tctttgct 

<210> 81 

<211> 206 

<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 

<222> (13) 

<22 3> Xaa=unknown amino acid 

<400> 81 

Met lie Ser Leu Pro Gly Pro Leu Val Thr Asn Leu Xaa Arg Phe Leu 

15 10 15 

Phe Leu Gly Leu Ser Ala Leu Ala Pro Pro Ser Arg Ala Gin Leu Gin 

20 25 30 

Leu His Leu Pro Ala Asn Arg Leu Gin Ala Val Glu Glu Gly Glu Ser 
35 40 45 



gctgctcctg ggcctggcgg ccggctcgcc cccactggac 60 

ctgcccgggg caccccggcc ttccaggcac gccgggccac 120 

gggccgcgat ggccgcgacg gccgcgacgg cgcgcccggg 180 

gggcgggagg cgggactgcc gggacctcga ggggaccccg 24 0 

cccgcggggc ccaccgggcc tgccggggag tgctcggtgc 300 

gccaagcgct ccgagagccg ggtgcctccg ccgtctgacg 360 

gtgctggtga acgagcaggg acattacgac gccgtcaccg 420 

cctggggtct actacttcgc cgtccatgcc accgtctacc 480 

ctggtgaaga atggcgaatc ccttgcctct ttcttccagt 540 

ccagcctcgc tctcgggggg ggccatggtg aggctggagc 600 

caggtgggtg tgggtgacta cattggcatc tatgccagca 660 

tccggatttc tggtgtactc cgactggcac agctccccag 720 

728 
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Gly 


Ala 
50 


Ser 


Ala 


Trp 


Tyr 


Thr 
55 


Leu 


Pro 


Trp 


Glu 


Val 


Pro 


Phe 


Val 


Met 


65 










70 






Glu 


Asp 


Gin 


Val 


Leu 
85 


Ser 


Tyr 


He 


Gly 


Val 


Ser 


Leu 
100 


Val 


Tyr 


Ser 


Met 


Val 


Glu 


Gly 
115 


Leu 


Gin 


Glu 


Lys 


Asp 
120 


Asn 


Val 


Gin 


Asp 


Lys 


Gin Gly Lys 




130 










135 




Leu 


Glu 


Leu 


Asn 


Val 


Leu 


Val 


Pro 


145 










150 






Gin 


Gly 


Val 


Pro 


His 


Val 


Gly Ala 










165 








Pro 


Arg 


Ser 


Lys 
180 


Pro 


Ala 


Val 


Gin 


Ser 


Phe 


Gin 
195 


Thr 


Phe 


Phe 


Ala 


Pro 
200 



<210> 82 
<211> 217 
<212> PRT 

<213> Homo sapiens 



<400> 82 












Gin Leu 


Gin 


Leu 


His Leu 


Pro 


Ala 


1 






5 






Gly Glu 


Ser Gly 


Ala Ser 


Ala 


Trp 






20 








Ser Ser 


Gin 


Pro 


Trp Glu 


Val 


Pro 




35 








40 


Lys Glu 


Lys 


Glu 


Asp Gin 


Val 


Leu 


50 








55 




Ser Lys 


Pro Gly 


Val Ser 


Leu 


Val 


65 






70 






Ser Leu 


Arg Val 


Glu Gly Leu Gin 








85 






Cys Ser 


Val 


Asn 


Val Gin 


Asp 


Lys 






100 








He Lys 


Thr 


Leu 


Glu Leu 


Asn 


Val 




115 








120 


Cys Arg 


Leu 


Gin 


Gly Val 


Pro 


His 


130 








135 




Cys Gin 


Ser 


Pro 


Arg Ser 


Lys 


Pro 


145 






150 






Gin Leu 


Pro 


Ser 


Phe Gin 


Thr 


Phe 








165 






Arg Gly 


Ser 


Leu 


Ser Leu 


Thr 


Asn 






180 








Tyr Val 


Cys 


Lys 


Ala His 


Asn 


Glu 




195 








200 


Thr Leu 


Glu 


Val 


Ser Thr Gly 


Pro 


210 








215 





<210> 83 
<211> 220 



His Arg Glu Val 


Ser 


Ser 


Ser 


Gin 


60 










Trp Phe Phe Lys 


Gin 


Lys 


Glu 


Lys 


75 








80 


Asn Gly Val Thr 


Thr 


Ser 


Lys 


Pro 


90 






95 




Pro Ser Arg Asn 


Leu 


Ser 


Leu 


Arg 


105 




110 






Ser Gly Pro Tyr 


Ser 
125 


Cys 


Ser 


Val 


Ser Arg Gly His 


Ser 


He 


Lys 


Thr 


140 










Pro Ala Pro Pro 


Ser 


Cys 


Arg Leu 


155 








160 


Asn Val Thr Leu 


Ser 


Cys 


Gin 


Ser 


170 






175 




Tyr Gin Trp Asp 


Arg 


Gin 


Leu 


Pro 


185 




190 






Ala Leu Asp Val 


He 


Arg 







205 



Asn Arg Leu Gin 


Ala 


Val 


Glu 


Glu 




10 








15 




Tyr 


Thr Leu 


His 


Arg 


Glu 


Val 


Ser 


25 








30 






Phe 


Val Met 


Trp 


Phe 


Phe 


Lys 


Gin 








45 








Ser 


Tyr He 


Asn 


Gly 


Val 


Thr 


Thr 






60 










Tyr 


Ser Met 


Pro 


Ser 


Arg 


Asn 


Leu 




75 










80 


Glu 


Lys Asp 


Ser 


Gly 


Pro 


Tyr 


Ser 




90 








95 




Gin Gly Lys 


Ser 


Arg 


Gly 


His 


Ser 


105 








110 






Leu 


Val Pro 


Pro 


Ala 


Pro 


Pro 


Ser 








125 








Val 


Gly Ala Asn 


Val 


Thr 


Leu 


Ser 






140 










Ala 


Val Gin 


Tyr 


Gin 


Trp 


Asp Arg 




155 










160 


Phe 


Ala Pro 


Ala 


Leu 


Asp 


Val 


He 




170 








175 




Leu 


Ser Ser 


Ser 


Met 


Ala 


Gly Val 


185 








190 






Val 


Gly Thr Ala 


Gin 


Cys 


Asn 


Val 



205 



Gly 
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<212> PRT 

<213> Homo sapiens 



<400> 83 



Gl n 

Ui.ll 


T 

uc u 


P 1 n Ti*»n W"i o 
uiu ucu ni o 


T.oi i 
LcU 




Mia. 


Asn 


Arg 


Leu 


uin 


A 1 a 

Ala 


val 


ni ,« 
bill 


pi ii 

bill 


1 

1 




c 
D 










i n 

1VJ 










1 c 

lb 




Glv 


Glu 


Ser Glv Al a 


Ser 


Ala 


lip 


iyir 


Th-r 

x, ill 




Hi fi 

UJ.O 


r\ x. y 


Gl n 

UlU 


v a i 


Cor 
























n 






Ser 


Ser 


Gin Pro Tyr> 

\J -L. XX XT J. X X> 


Glu 


Val 


XT X, \J 


XT ilC 


Val 

VOi 




lip 


Php 

zr lie 


rue 


liy o 


Gl n 

will 












a n 










**. b 








Lys 


VjlU 


jjys vjiu Asp 


Gin 


Val 


Leu 


Ser 


Tyr 


Tl <=» 

lie 


Asn 


vjiy 


Va 1 
Vdl 


Thr 


Thr 




en 
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Car 


Lys 


rlO v7J.y vol 


Ser 


Leu 


Va 1 
Vdl 


Tyr 


Ser 


Mat- 
Net 


Pro 


Ser 


Arg 


Asn 


Leu 


o b 






70 










1 C 

/ b 










SO 


O o v* 


Leu 


niy vdl ulU 


Gly Leu 


Gl n 

Li 111 


rji ii 

UlU 


Lys 


Asp 


Cor 


Gl \/ 


Pro 


Tyr 


Ser 






D C 

ob 










90 










95 




Cys 


Ser 


Val ASI1 Vdl 


Gin Asp 


Lys 


r*l m 
oin 


uiy 


Lys 


Ser 


Arg 


uiy 


nlS 


Ser 






inn 
1 u u 








lUb 










Tin 
11U 






Tip 

lie 


Lys 


1X11 JjCU UlU 


Leu 


Asn 


Val 


Leu 


Vdl 


Pro 


Pro 


Aia 


Pro 


Pro 


Ser 






1 1 c 

lib 






ion 
1/1 U 










1^ b 












licU villi yj ±y 


Val 


Pro 


ni s 


Va 1 
Val 


Gl w 

vjiy 


Ala 


Asn 


Va 1 
Val 


i nr 


Leu 


Ser 




i ^ n 






135 










1 A A 

140 










Cys 


P 1 n 

bin 


oer Fro Arg 


Ser 


Lys 


Pro 


Aia 


vai 


bin 


Tyr 


pi _ 

Gin 


Trp 


ASp 


Arg 


1 A C 

145 






150 










155 










160 


Gin 


Leu 


Pro Ser Phe 


Gin 


Thr 


Phe 


Phe 


Ala 


Pro 


Ala 


Leu 


Asp 


Val 


He 






165 










170 










175 




Arg 


Gly 


Ser Leu Ser 


Leu 


Thr 


Asn 


Leu 


Ser 


Ser 


Ser 


Met 


Ala 


Gly 


Val 






180 








185 










190 






Tyr 


Val 


Cys Lys Ala 


His 


Asn 


Glu 


Val 


Gly 


Thr 


Ala 


Gin 


Cys 


Asn 


Val 






195 






200 










205 








Thr 


Leu 


Glu Val Ser 


Thr Gly 


Pro 


Gly 


Ala 


Ala 


Val 












210 






215 










220 











<210> 84 
<211> 202 
<212> PRT 

<213> Homo sapiens 
<400> 84 



Met 


Gly 


Pro 


Ser 


Thr 


Pro 


Leu 


Leu 


He 


Leu 


Phe 


Leu 


Leu 


Ser 


Trp 


Ser 


1 








5 










10 










15 




Gly 


Pro 


Leu 


Gin 


Gly 


Gin 


Gin 


His 


His 


Leu 


Val 


Glu 


Tyr 


Met 


Glu 


Arg 








20 










25 










30 






Arg 


Leu 


Ala 


Ala 


Leu 


Glu 


Glu 


Arg 


Leu 


Ala 


Gin 


Cys 


Gin 


Asp 


Gin 


Ser 






35 










40 










45 








Ser 


Arg 


His 


Ala 


Ala 


Glu 


Leu 


Arg 


Asp 


Phe 


Lys 


Asn 


Lys 


Met 


Leu 


Pro 




50 










55 










60 










Leu 


Leu 


Glu 


Val 


Ala 


Glu 


Lys 


Glu 


Arg 


Glu 


Ala 


Leu 


Arg 


Thr 


Glu 


Ala 


65 










70 










75 










80 


Asp 


Thr 


He 


Ser 


Gly 


Arg 


Val 


Asp 


Arg 


Leu 


Glu 


Arg 


Glu 


Val 


Asp 


Tyr 










85 










90 










95 




Leu 


Glu 


Thr 


Gin 


Asn 


Pro 


Ala 


Leu 


Pro 


Cys 


Val 


Glu 


Phe 


Asp 


Glu 


Lys 








100 










105 










110 






Val 


Thr 


Gly 


Gly 


Pro 


Gly Thr 


Lys 


Gly 


Lys 


Gly 


Arg 


Arg 


Asn 


Glu 


Lys 






115 










120 










125 








Tyr 


Asp 


Met 


Val 


Thr 


Asp 


Cys 


Gly 


Tyr 


Thr 


He 


Ser 


Gin 


Val 


Arg 


Ser 




130 










135 










140 










Met 


Lys 


He 


Leu 


Lys 


Arg 


Phe 


Gly 


Gly 


Pro 


Ala 


Gly 


Leu 


Trp 


Thr 


Lys 
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145 150 155 160 

Asp Pro Leu Gly Gin Thr Glu Lys He Tyr Val Leu Asp Gly Thr Gin 

165 170 175 

Asn Asp Thr Ala Phe Val Phe Pro Arg Leu Arg Asp Phe Thr Leu Ala 

180 185 190 

Met Ala Ala Arg Lys Ala Ser Arg Val Arg 
195 200 

<210> 85 
<211> 67 
<212> PRT 

<213> Homo sapiens 



<400> 85 








Ala Ser 


Arg 


Ala 


Arg 


1 






5 


Thr Pro 


Glu 


Arg Ala 






20 




His Ala 


Ser 


Leu 


Arg 




35 






Asp Asp 


Gly 


Tyr 


Gin 


50 








Glu Glu 


Val 






65 









He Gin Cys Ser Phe Asp 
10 

Ala Leu Pro Tyr Phe Pro 
25 

Tyr Asn Pro Arg Glu Arg 
40 

He Val Tyr Lys Leu Glu 
55 



Ala Ser Gly Thr Leu 
15 

Arg Arg Tyr Gly Ala 
30 

Gin Leu Tyr Ala Trp 
45 

Met Arg Lys Lys Glu 
60 



<210> 86 

<211> 19 

<212> PRT 

<213> Homo sapiens 



<400> 86 

Val Pro Phe Pro Trp Val . Gly Thr Gly Gin Leu Val Tyr Gly Gly Phe 

15 10 15 

Leu Tyr Phe 



<210> 87 
<211> 17 
<212> PRT 

<213> Homo sapiens 
<400> 87 

Ala Glu Ala Ala Phe Val He Cys Gly Thr Leu Tyr Val Val Tyr Asn 

15 10 15 

Thr 



<210> 88 
<211> 99 
<212> PRT 

<213> Homo sapiens 
<400> 88 

Ala Arg Arg Pro Pro Gly Arg Pro 

1 5 
Thr Leu Gin Leu He Lys Phe His 
20 

Ser Ser Val Phe Pro Ala Glu Gly 



Gly Gly Gly Gly Glu Met Glu Asn 

10 15 
Leu Ala Asn Arg Thr Val Val Asp 
25 30 
Leu He Pro Pro Tyr Gly Leu Thr 
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35 40 
Ala Asp Thr Tyr lie Asp Leu Ala 

50 55 
Val Tyr Ala Thr Arg Glu Asp Asp 
65 70 
Asp Pro Gin Thr Leu Asp Thr Glu 
85 

Arg Glu Asn 



45 

Ala Asp Glu Glu Gly Leu Trp Ala 
60 

Arg His Leu Cys Leu Ala Lys Leu 

75 '80 

Gin Gin Trp Asp Thr Pro Cys Pro 

90 95 



<210> 89 

<211> 320 

<212> PRT 

<213> Homo sapiens 



./| Aft ^ O Q 








Met Gly 


Pro 


Ser 


Thr 


J. 






b 


Gly Pro 


Leu 


Gin 


Gly 






20 




Arg Leu 


TV "I _ 

Ala 


Ala 


Leu 




3 D 






Ser Arg 


His 


Ala 


Ala 


C A 

50 








Leu Leu 


Glu 


Val 


Ala 


65 








Asp Thr 


lie 


Ser 


Gly 








85 


Leu Glu 


Thr 


/si „ 

Gin 


Asn 






^ rt a 
10 0 




vai Tnr 


Gly 


Gly 


Pro 




TIC 






Tyr Asp 


Met 


Val 


Thr 


1 J u 








Met Lys 


lie 


Leu 


Lys 


14 D 








Asp Pro 


Leu 


Gly 


Gin 








165 


Asn Asp 


Thr 


Ala 


Phe 






180 




Met Ala 


Ala 


Arg 


Lys 




195 






Gly Thr 


Gly 


Gin 


Leu 


210 








Pro Pro 


Gly 


Arg 


Pro 


225 








Leu lie 


Lys 


Phe 


His 








245 


Phe Pro 


Ala 


Glu 


Gly 






260 




Tyr lie 


Asp 


Leu 


Ala 




275 






Thr Arg 


Glu 


Asp 


Asp 


290 








Thr Leu 


Asp 


Thr 


Glu 



305 
<210> 90 



Pro Leu Leu lie Leu Phe 
10 

Gin Gin His His Leu Val 
25 

Glu Glu Arg Leu Ala Gin 
40 

Glu Leu Arg Asp Phe Lys 
55 

Glu Lys Glu Arg Glu Ala 
70 75 
Arg Val Asp Arg Leu Glu 
90 

Pro Ala Leu Pro Cys Val 
105 

Gly Thr Lys Gly Lys Gly 
120 

Asp Cys Gly Tyr Thr He 
135 

Arg Phe Gly Gly Pro Ala 
150 155 
Thr Glu Lys He Tyr Val 
170 

Val Phe Pro Arg Leu Arg 
185 

Ala Ser Arg Val Arg Val 
200 

Val Tyr Gly Gly Phe Leu 
215 

Gly Gly Gly Gly Glu Met 
230 235 
Leu Ala Asn Arg Thr Val 
250 

Leu He Pro Pro Tyr Gly 
265 

Ala Asp Glu Glu Gly Leu 
280 

Arg His Leu Cys Leu Ala 
295 

Gin Gin Trp Asp Thr Pro 
310 315 



Leu Leu Ser Trp Ser 
15 

Glu Tyr Met Glu Arg 
30 

Cys Gin Asp Gin Ser 
45 

Asn Lys Met Leu Pro 
60 

Leu Arg Thr Glu Ala 
80 

Arg Glu Val Asp Tyr 
95 

Glu Phe Asp Glu Lys 
110 

Arg Arg Asn Glu Lys 
125 

Ser Gin Val Arg Ser 
140 

Gly Leu Trp Thr Lys 
160 

Leu Asp Gly Thr Gin 
175 

Asp Phe Thr Leu Ala 
190 

Pro Phe Pro Trp Val 
205 

Tyr Phe Ala Arg Arg 
220 

Glu Asn Thr Leu Gin 
240 

Val Asp Ser Ser Val 
255 

Leu Thr Ala Asp Thr 
270 

Trp Ala Val Tyr Ala 
285 

Lys Leu Asp Pro Gin 
300 

Cys Pro Arg Glu Asn 
320 
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<211> 385 
<212> PRT 
<213> Homo sapiens 



<400> 90 



Gin Gin His His 


Leu 


Val 


Glu 


Tyr 


1 


5 








Glu Glu Arg Leu 


Ala 


Gin 


Cys 


Gin 


20 










Glu Leu Arg Asp 


Phe 


Lys 


Asn 


Lys 


35 








40 


Glu Lys Glu Arg 


Glu 


Ala 


Leu 


Arg 


50 






55 




Arg Val Asp Arg 


Leu 


Glu 


Arg 


Glu 


65 




70 






Pro Ala Leu Pro 


Cys 


Val 


Glu 


Phe 




85 








Gly Thr Lys Gly 


Lys 


Gly 


Arg 


Arg 


100 










Asp Cys Gly Tyr 


Thr 


He 


Ser 


Gin 


115 








120 


Arg Phe Gly Gly 


Pro 


Ala 


Gly 


Leu 


130 






135 




Thr Glu Lys lie 


Tyr Val 


Leu 


Asp 


145 




150 






Val Phe Pro Arg 


Leu 


Arg 


Asp 


Phe 




165 








Ala Ser Arg Val 


Arg 


Val 


Pro 


Phe 


180 










Val Tyr Gly Gly 


Phe 


Leu 


Tyr 


Phe 


195 








200 


Gly Gly Gly Gly 


Glu 


Met 


Glu 


Asn 


210 






215 




Leu Ala Asn Arg 


Thr 


Val 


Val 


Asp 


225 




230 






Leu lie Pro Pro 


Tyr 


Gly 


Leu 


Thr 




245 








Ala Asp Glu Glu 


Gly Leu 


Trp 


Ala 


260 










Arg His Leu Cys 


Leu 


Ala 


Lys 


Leu 


275 








280 


Gin Gin Trp Asp 


Thr 


Pro 


Cys 


Pro 


2 90, 






295 




Val lie Cys Gly 


Thr 


Leu 


Tvr 
A i x 


Val 


305 




310 






Arg Ala Arg lie 


Gin 


Cys 


Ser 


Phe 




325 








Glu Arg Ala Ala 


Leu 


Pro 


Tyr 


Phe 


340 










Ser Leu Arg Tyr 


Asn 


Pro 


Arg 


Glu 


355 








360 


Gly Tyr Gin lie 


Val 


Tyr 


Lys 


Leu 


370 






375 





Val 



385 

<210> 91 
<211> 728 



Met 


Glu Arg 


Arg 


Leu 


Ala 


Ala 


Leu 




10 










15 




Asp 


Gin 


Ser 


Ser 


Arq 


His 


Ala 


Ala 


25 










30 






Met 


Leu 


Pro 


Leu 


Leu 


Glu 


Val 


Ala 










45 








Thr 


Glu 


Ala 


Asp 


Thr 


He 


Ser 


Gly 








60 










Val 


Asp 


Tyr 


Leu 


Glu 


Thr 


Gin 


Asn 






75 










80 


Asp 


Glu 


Lys 


Val 


Thr 


Gly 


Gly 


Pro 




90 










95 




Asn 


Glu 


Lys 


Tyr 


ASP 


Met 


Val 


Thr 


105 










110 






Val 


Arg 


Ser 


Met 


Lys 


He 


Leu 


Lys 










125 








Trp 


Thr 


Lys 


Asp 


Pro 


Leu 


Gly 


Gin 








140 










Gly 


Thr 


Gin 


Asn 


Asp 


Thr 


Ala 


Phe 






155 










160 


Thr 


Leu 


Ala 


Met 


Ala 


Ala 


Ara 


Lys 




170 










175 




Pro 


Trp 


Val 


Gly 


Thr 


Gly 


Gin 


Leu 


185 










190 






Ala 


Arg Arg 


Pro 


Pro 


Gly 


Arq 


Pro 










205 








Thr 


Leu 


Gin 


Leu 


He 


Lys 


Phe 


His 








220 










Ser 


Ser 


Val 


Phe 


Pro 


Ala 


Glu 


Gly 






235 










240 


Ala 


Asp 


Thr 


Tyr 


He 


Asp 


Leu 


Ala 




250 










255 




Val 


Tyr Ala 


Thr 


Arq 


Glu 


Asp 


Asp 


265 










270 






Asp 


Pro 


Gin 


Thr 


Leu 


Asp 


Thr 


Glu 










285 








Arg 


Glu 


Asn 


Ala 


Glu 


Ala 


Ala 


Phe 








300 










Val 


Tyr Asn 


Thr 


Arg 


Pro 


Ala 


Ser 






315 










320 


Asp 


Ala 


Ser 


Gly 


Thr 


Leu 


Thr 


Pro 




330 










335 




Pro 


Arg Arg 


Tyr 


Gly 


Ala 


His 


Ala 


345 










350 






Arg 


Gin 


Leu 


Tyr 


Ala 


Trp 


Asp 


Asp 










365 








Glu 


Met Arg 


Lys 


Lys 


Glu 


Glu 


Glu 



380 
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<212> DNA 

<213> Homo sapiens 



<400> 91 

atgaggccac tcctcgtcct gctgctcctg ggcctggcgg ccggctcgcc cccactggac 60 

gacaacaaga tccccagcct ctgcccgggg caccccggcc ttccaggcac gccgggccac 12 0 

catggcagcc agggcttgcc gggccgcgat ggccgcgacg gccgcgacgg tgcgcccggg 180 

gctccgggag agaaaggcga gggcgggagg cgggactgcc gggacctcga ggggaccccg 24 0 

ggccgcgagg agaggcggga cccgcggggc ccaccgggcc tgccggggag tgctcggtgc 3 00 

ctccgcgatc cgccttcagc gccaagcgct ccgagagccg ggtgcctccg ccgtctgacg 360 

cacccttgcc cttcgaccgc gtgctggtga acgagcaggg acattacgac gccgtcaccg 420 

gcaagttcac ctgccaggtg cctggggtct actacttcgc cgtccatgcc accgtctacc 480 

gggccagcct gcagtttgat ctggtgaaga atggcgaatc cattgcctct ttcttccagt 54 0 

ttttcggggg gtggcccaag ccagcctcgc tctcgggggg ggccatggtg aggctggagc 600 

ctgaggacca agtgtgggtg caggtgggtg tgggtgacta cattggcatc tatgccagca 660 

tcaagacaga cagcaccttc tccggatttc tggtgtactc cgactggcac agctccccag 720 

tctttgct 728 

<210> 92 
<211> 69 
<212> PRT 

<213> Homo sapiens 



<400> 92 



Arg Pro Ala 


Ser 


Arg 


Ala 


Arg 


lie Gin Cys Ser Phe 


Asp 


Ala 


Ser 


Gly 


1 




5 






10 






15 




Thr Leu Thr 


Pro 


Glu 


Arg 


Ala 


Ala Leu Pro Tyr Phe 


Pro 


Arg 


Arg 


Tyr 




20 








25 




30 






Gly Ala His 


Ala 


Ser 


Leu 


Arg 


Tyr Asn Pro Arg Glu 


Arg 


Gin 


Leu 


Tyr 


35 










40 


45 








Ala Trp Asp 


Asp Gly 


Tyr 


Gin 


lie Val Tyr Lys Leu 


Glu 


Met 


Arg 


Lys 



50 55 60 

Lys Glu Glu Glu Val 
65 



<210> 93 

<211> 202 

<212> PRT 

<213> Mus musculus 



<400> 93 



Met 


Gly 


Pro 


Ser 


Ala 


Pro 


Leu 


Leu Leu Leu Phe Phe 


Leu 


Ser 


Trp 


Thr 


1 








5 






10 






15 




Gly 


Pro 


Leu 


Gin 


Gly 


Gin 


Gin 


His His Leu Val Glu 


Tyr 


Met 


Glu 


Arg 








20 








25 




30 






Arg 


Leu 


Ala 


Ala 


Leu 


Glu 


Glu 


Arg Leu Ala Gin Cys 


Gin 


Asp 


Gin 


Ser 






35 










40 


45 








Ser 


Arg 


His 


Ala 


Ala 


Glu 


Leu 


Arg Asp Phe Lys Asn 


Lys 


Met 


Leu 


Pro 




50 










55 


60 










Leu 


Leu 


Glu 


Val 


Ala 


Glu 


Lys 


Glu Arg Glu Thr Leu 


Arg 


Thr 


Glu 


Ala 


65 










70 




75 








80 


Asp 


Ser 


He 


Ser 


Gly 


Arg 


Val 


Asp Arg Leu Glu Arg 


Glu 


Val 


Asp 


Tyr 










85 






90 






95 




Leu 


Glu 


Thr 


Gin 


Asn 


Pro 


Ala 


Leu Pro Cys Val Glu 


Leu 


Asp 


Glu 


Lys 








100 








105 




110 






Val 


Thr 


Gly 


Gly 


Pro 


Gly 


Ala 


Lys Gly Lys Gly Arg 


Arg 


Asn 


Glu 


Lys 






115 










120 


125 








Tyr 


Asp 


Met 


Val 


Thr 


Asp 


Cys 


Ser Tyr Thr Val Ala 


Gin 


Val 


Arg 


Ser 
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130 






135 140 




Met 


Lys 


He 


Leu 


Lys Arg Phe Gly Gly Ser Val Gly Leu Trp Thr Lys 


145 








150 155 


160 


Asp 


Pro 


Leu 


Gly 


Pro Ala Glu Lys He Tyr Val Leu Asp 


Gly Thr Gin 










165 " 170 


175 


Asn Asp 


Thr 


Ala 


Phe Val Phe Pro Arg Leu Arg Asp Phe 


Thr Leu Ala 








180 


185 


190 


Met 


Ala 


Ala 


Arg 


Lys Ala Ser Arg He Arg 








195 




200 





<210> 94 

<211> 69 

<212> PRT 

<213> Mus musculus 



<400> 94 

Arg Pro Ala Ser Arg Ala Arg He Gin Cys Ser Phe. Asp Ala Ser Gly 

15 10 15 

Thr Leu Ala Pro Glu Arg Ala Ala Leu Ser Tyr Phe Pro Arg Arg Tyr 

20 25 30 

Gly Ala His Ala Ser Leu Arg Tyr Asn Pro Arg Glu Arg Gin Leu Tyr 

35 40 45 

Ala Trp Asp Asp Gly Tyr Gin He Val Tyr Lys Leu Glu Met Lys Lys 

50 55 60 

Lys Glu Glu Glu Val 
65 

<210> 95 
<211> 19 
<212> PRT 

<213> Mus musculus 
<400> 95 

Val Pro Phe Pro Trp Val Gly Thr Gly Gin Leu Val Tyr Gly Gly Phe 

15 10 15 

Leu Tyr Tyr 



<210> 96 

<211> 16 

<212> PRT 

<213> Mus musculus 

<400> 96 

Glu Ala Ala Phe Val He Cys Gly Thr Leu Tyr Val Val Tyr Asn Thr 
1 5 10 15 

<210> 97 
<211> 99 
<212> PRT 

<213> Mus musculus 
<400> 97 

Ala Arg Arg Pro Pro Gly Gly Pro Gly Gly Gly Gly Glu Leu Glu Asn 

1 5 10 15 

Thr Leu Gin Leu He Lys Phe His Leu Ala Asn Arg Thr Val Val Asp 

20 25 30 

Ser Ser Val Phe Pro Ala Glu Ser Leu He Pro Pro Tyr Gly Leu Thr 
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35 40 
Ala Asp Thr Tyr lie Asp Leu Ala 

50 55 
Val Tyr Ala Thr Arg Asp Asp Asp 
65 70 
Asp Pro Gin Thr Leu Asp Thr Glu 
85 

Arg Glu Asn 



45 

Ala Asp Glu Glu Gly Leu Trp Ala 
60 

Arg His Leu Cys Leu Ala Lys Leu 

75 80 
Gin Gin Trp Asp Thr Pro Cys Pro 
90 95 



<210> 98 

<211> 320 

<212> PRT 

<213> Mus musculus 



a n n v. q o 


















wet Gly 


Pro 


Ser 


Ala Pro 


Leu 


Leu Leu Leu Phe 


Phe Leu Ser 


Trp 


Thr 


JL 






5 




10 




15 




Gly Pro 


T ^ • . 

Leu 


Gin 


Gly Gin Gin 


His His Leu Val 


Glu Tyr Met 


Glu 


Arg 






2 0 






25 


30 




Arg Leu 


Ala 


Ala 


Leii Glu 


Glu 


Arg Leu Ala Gin 


Cys Gin Asp Gin 


Ser 




•5 b 








40 


45 






Ser Arg 


His 


Ala 


Ala Glu 


Leu 


Arg Asp Phe Lys 


Asn Lys Met 


Leu 


Pro 


D U 








55 




60 






Leu Leu 


Glu 


Val 


Ala Glu 


Lys 


Glu Arg Glu Thr 


Leu Arg Thr Glu Ala 


bb 






70 




75 






80 


Asp Ser 


lie 


Ser 


Gly Arg Val 


Asp Arg Leu Glu 


Arg Glu Val 


Asp 


Tyr 








85 




90 




95 




Leu Glu 


Thr 


Gin 


Asn Pro 


Ala 


Leu Pro Cys Val 


Glu Leu Asp 


Glu 


Lys 






100 






105 


110 






Val Thr 


Gly 


Gly 


Pro Gly Ala 


Lys Gly Lys Gly 


Arg Arg Asn 


Glu 


Lys 




115 








12 0 


125 






Tyr Asp 


Met 


val 


Thr Asp 


Cys 


Ser Tyr Thr Val 


Ala Gin Val 


Arg 


Ser 


Tin 
1 J U 








135 




140 




Met Lys 


lie 


Leu 


Lys Arg 


Phe 


Gly Gly Ser Val 


Gly Leu Trp 


Thr 


Lys 


1 A C 
14 D 






150 




155 






160 


Asp Pro 


Leu 


Gly 


Pro Ala 


Glu 


Lys He Tyr Val 


Leu Asp Gly Thr Gin 








165 




170 




175 




Asn Asp 


Thr 


Ala 


Phe Val 


Phe 


Pro Arg Leu Arg 


Asp Phe Thr 


Leu 


Ala 






180 






185 


190 






Met Ala 


Ala 


Arg 


Lys Ala 


Ser 


Arg He Arg Val 


Pro Phe Pro 


Trp 


Val 




195 








200 


205 






Gly Thr 


Gly 


Gin 


Leu Val 


Tyr 


Gly Gly Phe Leu 


Tyr Tyr Ala 


Arg 


Arg 


210 








215 




220 






Pro Pro 


Gly 


Gly 


Pro Gly Gly 


Gly Gly Glu Leu 


Glu Asn Thr 


Leu 


Gin 


225 






230 




235 






240 


Leu lie 


Lys 


Phe 


His Leu 


Ala 


Asn Arg Thr Val 


Val Asp Ser 


Ser 


Val 








245 




250 




255 




Phe Pro 


Ala 


Glu 


Ser Leu 


He 


Pro Pro Tyr Gly 


Leu Thr Ala 


Asp 


Thr 






260 






265 


270 




Tyr lie 


Asp 


Leu 


Ala Ala Asp 


Glu Glu Gly Leu 


Trp Ala Val 


Tyr 


Ala 




275 








280 


285 






Thr Arg 


Asp 


Asp 


Asp Arg His 


Leu Cys Leu Ala 


Lys Leu Asp 


Pro 


Gin 


290 








295 




300 






Thr Leu 


Asp 


Thr 


Glu Gin 


Gin 


Trp Asp Thr Pro 


Cys Pro Arg 


Glu 


Asn 


305 






310 




315 






320 



<210> 99 
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<211> 299 
<212> PRT 
<213> Mus musculus 



<400> 99 



Gin 


Gin 


His His 


Leu Val 


Glu 


Tyr 


Met 


Glu 


Arg 


Arg 


Leu 


Ala 


Ala 


Leu 


1 






5 








10 










15 




Glu 


Glu 


Arg Leu 
20 


Ala Gin 


Cys 


Gin 


Asp 
25 


Gin 


Ser 


Ser 


Arg 


His 
30 


Ala 


Ala 


Glu 


Leu 


Arg Asp 
35 


Phe Lys 


Asn 


Lys 
40 


Met 


Leu 


Pro 


Leu 


Leu 
45 


Glu 


Val 


Ala 


Glu 


Lys 
50 


Glu Arg 


Glu Thr 


Leu 
55 


Arg 


Thr 


Glu 


Ala 


Asp 
60 


Ser 


He 


Ser 


Gly 


Arg Val 


Asp Arg 


Leu Glu 


Arg 


Glu 


Val 


Asp 
***** 


Tyr 


Leu 


Glu 


Thr 


Gin 


Asn 


65 






70 










75 










80 


Pro 


Ala 


Leu Pro 


Cys Val 
85 


Glu 


Leu 


Asp 


Glu 
90 


Lys 


Val 


Thr 


Gly 


Gly 
95 


Pro 


Gly Ala 


Lys Gly Lys Gly 


Arg 


Arg 


Asn 


Glu 


Lys 


Tyr 


Asp 


Met 


Val 


Thr 






100 








105 










110 






Asp Cys 


Ser Tyr 


Thr Val 


Ala 


Gin 


Val 


Arg 


Ser 


Met 


Lys 


He 


Leu 


Lys 






115 






120 










125 








Arg 


Phe 


Gly Gly Ser Val 


Gly 


Leu 


Trp 


Thr 


Lys 


ASP 
***x* 


Pro 


Leu 


Gly 


Pro 




130 






135 










140 










Ala 


Glu 


Lys He 


Tyr Val 


Leu 


Asp 


Gly 


Thr 


Gin 


Asn 


Asp 


Thr 


Ala 


Phe 


145 






150 










155 










160 


val 


Phe 


Pro Arg 


Leu Arg 
165 


Asp 


Phe 


Thr 


Leu 
170 


Ala 


Met 


Ala 


Ala 


Arc* 
175 


Lvs 


Ala 


Ser 


Arg He 
180 


Arg Val 


Pro 


Phe 


Pro 
185 


Trp 


Val 


Gly 


Thr 


Gly 
190 


Gin 


Leu 


Val 


Tyr 


Gly Gly Phe Leu 


Tyr 


Tyr 


Ala 


Arg 


Arq 


Pro 


Pro 


Gly 


Gly 


Pro 






195 






200 










205 








Gly Gly 


Gly Gly Glu Leu 


Glu 


Asn 


Thr 


Leu 


Gin 


Leu 


He 


Lys 


Phe 


His 




210 






215 










220 










Leu 


Ala 


Asn Arg 


Thr Val 


Val 


Asd 


Ser 


Ser 


Val 


Phe 


Pro 


Ala 


Glu 


Ser 


225 






230 










235 










240 


Leu 


He 


Pro Pro Tyr Gly 


Leu 


Thr 


Ala 


Asp 


Thr 


Tyr 


He 


Asp 


Leu 


Ala 








245 








250 










255 




Ala 


Asp 


Glu Glu Gly Leu 


Trp 


Ala 


Val 


Tyr 


Ala 


Thr 


Arg 


Asp 


Asp 


Asp 






260 








265 










270 






Arg 


His 


Leu Cys 
275 


Leu Ala 


Lys 


Leu 
280 


Asp 


Pro 


Gin 


Thr 


Leu 
285 


Asp 


Thr 


Glu 


Gin 


Gin 
290 


Trp Asp 


Thr Pro 


Cys 
295 


Pro 


Arg 


Glu 


Asn 













<210> 100 

<211> 728 

<212> DNA 

<213> Homo sapiens 



<400> 100 

atgaggccac tcctcgtcct gctgctcctg ggcctggcgg ccggctcgcc cccactggac 60 

gacaacaaga tccccagcct ctgcccgggg caccccggcc ttccaggcac gccgggccac 120 

catggcagcc agggcttgcc gggccgcgat ggccgcgacg gccgcgacgg cgcgcccggg 180 

gctccgggag agaaaggcga gggcgggagg cgggactgcc gggacctcga ggggaccccg 24 0 

ggccgcgagg agaggcggga cccgcggggc ccaccgggcc tgtcggggag tgctcggtgc 300 

ctccgcgatc cgccttcagc gccaagcgct ccgagagccg ggtgcctccg ccgtctgacg 360 

cacccttgcc cttcgaccgc gtgctggtga acgagcaggg acattacgac gccgtcaccg 420 

gcaagttcac ctgccaggtg cctggggtct actacttcgc cgtccatgcc accgtctacc 480 
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gggccagcct gcagtttgat ctggtgaaga 

ttttcggggg gtggcccaag ccagcctcgc 

ctgaggacca agtgtgggtg caggtgggtg 

tcaagacaga cagcaccttc tccggatttc 
tctttgct 

<210> 101 
<211> 728 
<212> DNA 

<213> Homo sapiens 

<400> 101 
atgaggccac tcctcgtcct gctgctcctg 
gacaacaaga tccccagcct ctgcccgggg 
catggcagcc agggcttgcc gggccgcgat 
gctccgggag agaaaggcga gggcgggagg 
ggccgcgagg agaggcggga cccgcggggc 
ctccgcgatc cgccttcagc gccaagcgct 
cacccttgcc cttcgaccgc gtgctggtga 
gcaagttcac ctgccaggtg cctggggtct 
gggccagcct gcagtttgat ctggtgaaga 
ttttcggggg gtggcccaag ccagcctcgc 
ctgaggacca agtgtgggtg caggtgggtg 
tcaagacaga cagcaccttc tccggatttc 
tctttgct 

<210> 102 

<211> 243 

<212> PRT 

<213> Homo sapiens 

<400> 102 



Met 


Arg 


Pro 


Leu 


Leu Val Leu Leu Leu Leu 


Gly Leu 


Ala 


Ala 


Gly 


Ser 


1 








5 10 








15 




Pro 


Pro 


Leu 


Asp 


Asp Asn Lys lie Pro Ser 


Leu Cys 


Pro 


Gly 


His 


Pro 








20 


25 






30 






Gly 


Leu 


Pro 


Gly 


Thr Pro Gly His His Gly 


Ser Gin 


Gly 


Leu 


Pro 


Gly 






35 




40 




45 






Arg 


Asp 


Gly 


Arg 


Asp Gly Arg Asp Gly Val 


Pro Gly 


Ala 


Pro 


Gly 


Glu 




50 






55 


60 








Lys 


Gly 


Glu 


Gly 


Gly Arg Pro Gly Leu Pro 


Gly Pro 


Arg 


Gly 


Asp 


Pro 


65 








70 


75 








80 


Gly 


Pro 


Arg 


Gly 


Glu Ala Gly Pro Ala Gly 


Pro Thr 


Gly 


Pro 


Ala 


Gly 










85 90 








95 


Glu 


Cys 


Ser 


Val 


Pro Pro Arg Ser Ala Phe 


Ser Ala 


Lys 


Arg 


Ser 


Glu 








100 


105 






110 






Ser 


Arg 


Val 


Pro 


Pro Pro Ser Asp Ala Pro 


Leu Pro 


Phe 


Asp 


Arg 


Val 






115 




120 




125 






Leu 


Val 


Asn 


Glu 


Gin Gly His Tyr Asp Ala 


Val Thr 


Gly 


Lys 


Phe 


Thr 




130 






135 


140 








Cys 


Gin 


Val 


Pro 


Gly Val Tyr Tyr Phe Ala 


Val His 


Ala 


Thr 


Val 


Tyr 


145 








150 


155 








160 


Arg 


Ala 


Ser 


Leu 


Gin Phe Asp Leu Val Lys 


Asn Gly 


Glu 


Ser 


He 


Ala 










165 170 








175 




Ser 


Phe 


Phe 


Gin 


Phe Phe Gly Gly Trp Pro 


Lys Pro 


Ala 


Ser 


Leu 


Ser 








180 


185 






190 






Gly 


Gly 


Ala 


Met 


Val Arg Leu Glu Pro Glu 


Asp Gin 


Val 


Trp 


Val 


Gin 






195 




200 




205 









61 



atggcgaatc cattgcctct ttcttccagt 540 

tctcgggggg ggccatggtg aggctggagc 600 

tgggtgacta cattggcatc tatgccagca 660 

tggtgtactc cgactggcac agctccccag 72 0 



728 



ggcctggcgg ccggctcgcc cccactggac 60 

caccccggcc ttccaggcac gccgggccac 12 0 

ggccgcgacg gccgcgacgg cgcgcccggg 180 

cgggactgcc gggacctcga ggggaccccg 240 

ccaccgggcc tgccggggag tgctcggtgc 300 

ccgagagccg ggtgcctccg ccgtctgacg 360 

acgagcaggg acattacgac gccgtcaccg 420 

actacttcgc cgtccatgcc accgtctacc 480 

atggcgaatc cattgcctct ttcttccagt 540 

tctcgggggg ggccatggtg aggctggagc 600 

tgggtgacta cattggcatc tatgccagca 660 

tggtgtactc cgactggcac agctccccag 720 
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Val Gly Val Gly Asp Tyr He Gly He Tyr. Ala Ser He Lys Thr Asp 

210 215 220 

Ser Thr Phe Ser Gly Phe Leu Val Tyr Ser Asp Trp His Ser Ser Pro 
225 230 235 240 

Val Phe Ala 



<210> 103 
<211> 1338 
<212> DNA 

<213> Homo sapiens 
<400> 103 

gtcgacccac gcgtccggga ctggggtgac ggcagggcag ggggcgcctg gccggggaga 60 

agcgcggggg ctggagcacc accaactgga gggtccggag tagcgagcgc cccgaaggag 120 

gccatcgggg agccgggagg ggggactgcg agaggacccc ggcgtccggg ctcccggtgc 180 

cagcgctatg aggccactcc tcgtcctgct gctcctgggc ctggcggccg gctcgccccc 240 

actggacgac aacaagatcc ccagcctctg cccggggcac cccggccttc caggcacgcc 300 

gggccaccat ggcagccagg gcttgccggg ccgcgatggc cgcgacggcc gcgacggtgc 360 

gcccggggct ccgggagaga aaggcgaggg cgggaggcgg gactgccggg acctcgaggg 420 

gaccccgggc cgcgaggaga ggcgggaccc gcggggccca ccgggcctgc cggggagtgc 4 80 

tcggtgcctc cgcgatccgc cttcagcgcc aagcgctccg agagccgggt gcctccgccg 540 

tctgacgcac ccttgccctt cgaccgcgtg ctggtgaacg agcagggaca ttacgacgcc 600 

gtcaccggca agttcacctg ccaggtgcct ggggtctact acttcgccgt ccatgccacc 660 

gtctaccggg ccagcctgca gtttgatctg gtgaagaatg gcgaatccat tgcctctttc 720 

ttccagtttt tcggggggtg gcccaagcca gcctcgctct cggggggggc catggtgagg 780 

ctggagcctg aggaccaagt gtgggtgcag gtgggtgtgg gtgactacat tggcatctat 840 

gccagcatca agacagacag caccttctcc ggatttctgg tgtactccga ctggcacagc 900 

tccccagtct ttgcttagtg cccactgcaa agtgagctca tgctctcact cctagaagga 960 

gggtgtgagg ctgacaacct ggtcatccag gagggctggc ccccctggaa tattgtgaat 1020 

gactagggag gtggggtaga gcactctccg tcctgctgct ggcaaggaat gggaacagtg 1080 

gctgtctgcg atcaggtctg gcagcatggg gcagtggctg gatttctgcc caagaccaga 1140 

ggagtgtgct gtgctggcaa gtgtaagtcc cccagttgct ctggtccagg agcccacggt 1200 

ggggtgctct cttcctggtc ctctgcttct ctggatcctc cccaccccct cctgctcctg 1260 

gggccggccc ttttctcaga gatcactcaa taaacctaag aaccctccaa aaaaaaaaaa 1320 

aaaaaaaagg gcggccgc 1338 

<210> 104 
<211> 243 
<212> PRT 

<213> Homo sapiens 



<400> 104 



Met 


Arg 


Pro 


Leu 


Leu 


Val 


Leu 


Leu 


Leu 


Leu 


Gly 


Leu 


Ala 


Ala 


Gly 


Ser 


1 








5 










10 










15 




Pro 


Pro 


Leu 


Asp 


Asp 


Asn 


Lys 


He 


Pro 


Ser 


Leu 


Cys 


Pro 


Gly 


His 


Pro 








20 










25 










30 






Gly 


Leu 


Pro 


Gly 


Thr 


Pro 


Gly 


His 


His 


Gly 


Ser 


Gin 


Gly 


Leu 


Pro 


Gly 






35 










40 










45 








Arg 


Asp 


Gly 


Arg 


Asp 


Gly 


Arg 


Asp 


Gly 


Ala 


Pro 


Gly 


Ala 


Pro 


Gly 


Glu 




50 










55 










60 










Lys 


Gly 


Glu 


Gly 


Gly 


Arg 


Pro 


Gly 


Leu 


Pro 


Gly 


Pro 


Arg 


Gly 


Asp 


Pro 


65 










70 










75 










80 


Gly 


Pro 


Arg 


Gly 


Glu 


Ala 


Gly 


Pro 


Ala 


Gly 


Pro 


Thr 


Gly 


Pro 


Val 


Gly 










85 










90 










95 




Glu 


Cys 


Ser 


Val 


Pro 


Pro 


Arg 


Ser 


Ala 


Phe 


Ser 


Ala 


Lys 


Arg 


Ser 


Glu 








100 










105 










110 






Ser 


Arg 


Val 


Pro 


Pro 


Pro 


Ser 


Asp 


Ala 


Pro 


Leu 


Pro 


Phe 


Asp 


Arg 


Val 
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115 

Leu Val Asn Glu 
130 

Cys Gin Val Pro 
145 

Arg Ala Ser Leu 

Ser Phe Phe Gin 
180 

Gly Gly Ala Met 
195 

Val Gly Val Gly 
210 

Ser Thr Phe Ser 
225 

Val Phe Ala 









120 


m v-i 
vjin 


Gly 


HIS 


Tyr 






135 




Gly 


Val 


Tyr 


Tyr 




150 






Gin 


Phe 


Asp 


Leu 


165 








pne 


Pne 


Gly 


Gly 


Val 


Arg 


Leu 


Glu 








200 


Asp 


Tyr 


He 


Gly 






215 




Gly 


Phe 


Leu 


Val 




230 







Asp Ala Val Thr 
140 

Phe Ala Val His 
155 

Val Lys Asn Gly 
170 

Trp Pro Lys Pro 
185 

Pro Glu Asp Gin 

He Tyr Ala Ser 
220 

Tyr Ser Asp Trp 
235 



125 

Gly Lys Phe Thr 

Ala Thr Val Tyr 
160 

Glu Ser He Ala 
175 

Ala Ser Leu Ser 
190' 

Val Trp Val Gin 
205 

He Lys Thr Asp 

His Ser Ser Pro 
240 



<210> 105 

<211> 1338 

<212> DNA 

<213> Homo sapiens 

<400> 105 

gtcgacccac gcgtccggga ctggggtgac ggcagggcag ggggcgcctg gccggggaga 60 

agcgcggggg ctggagcacc accaactgga gggtccggag tagcgagcgc cccgaaggag 120 

gccatcgggg agccgggagg ggggactgcg agaggacccc ggcgtccggg ctcccggtgc 180 

cagcgctatg aggccactcc tcgtcctgct gctcctgggc ctggcggccg gctcgccccc 240 

actggacgac aacaagatcc ccagcctctg cccggggcac cccggccttc caggcacgcc 300 

gggccaccat ggcagccagg gcttgccggg ccgcgatggc cgcgacggcc gcgacggcgc 360 

gcccggggct ccgggagaga aaggcgaggg cgggaggcgg gactgccggg acctcgaggg 420 

gaccccgggc cgcgaggaga ggcgggaccc gcggggccca ccgggcctgt cggggagtgc 4 80 

tcggtgcctc cgcgatccgc cttcagcgcc aagcgctccg agagccgggt gcctccgccg 540 

tctgacgcac ccttgccctt cgaccgcgtg ctggtgaacg agcagggaca ttacgacgcc 600 

gtcaccggca agttcacctg ccaggtgcct ggggtctact acttcgccgt ccatgccacc 660 

gtctaccggg ccagcctgca gtttgatctg gtgaagaatg gcgaatccat tgcctctttc 720 

ttccagtttt tcggggggtg gcccaagcca gcctcgctct cggggggggc catggtgagg 780 

ctggagcctg aggaccaagt gtgggtgcag gtgggtgtgg gtgactacat tggcatctat 84 0 

gccagcatca agacagacag caccttctcc ggatttctgg tgtactccga ctggcacagc 900 

tccccagtct ttgcttagtg cccactgcaa agtgagctca tgctctcact cctagaagga 960 

999tgtgagg ctgacaacct ggtcatccag gagggctggc ccccctggaa tattgtgaat 102 0 

gactagggag gtggggtaga gcactctccg tcctgctgct ggcaaggaat gggaacagtg 1080 

gctgtctgcg atcaggtctg gcagcatggg gcagtggctg gatttctgcc caagaccaga 1140 

ggagtgtgct gtgctggcaa gtgtaagtcc cccagttgct ctggtccagg agcccacggt 1200 

ggggtgctct cttcctggtc ctctgcttct ctggatcctc cccaccccct cctgctcctg 1260 

gggccggccc ttttctcaga gatcactcaa taaacctaag aaccctccaa aaaaaaaaaa 1320 

aaaaaaaagg gcggccgc 1338 

<210> 106 
<211> 243 
<212> PRT 
<213> Homo sapiens 

<400> 106 

Met Arg Pro Leu Leu Val Leu Leu Leu Leu Gly Leu Ala Ala Gly Ser 

15 10 15 

Pro Pro Leu Asp Asp Asn Lys He Pro Ser Leu Cys Pro Gly His Pro 
20 25 30 
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Gly Leu Pro 


Gly 


Thr 


Pro 


Gly 


His 


His 


Gly 


Ser 


Gin 


Gly 


Leu 


Pro 


Gly 


35 










40 










45 








Arg Asp Gly 


Arg 


Asp 


Gly 


Arg 


Asp 


Gly 


Ala 


Pro 


Gly 


Ala 


Pro 


Gly 


Glu 


50 








55 










60 










Lys Gly Glu 


Gly 


Gly 


Arg 


Pro 


Gly 


Leu 


Pro 


Gly 


Pro 


Arg 


Gly 


Asp 


Pro 


65 






70 










75 










80 


Gly Pro Arg 


Gly 


Glu 


Ala 


Gly 


Pro 


Ala 


Gly 


Pro 


Thr 


Gly 


Pro 


Ala 


Gly 






85 










90 










95 




Glu Cys Ser 


Val 


Pro 


Pro 


Arg 


Ser 


Ala 


Phe 


Ser 


Ala 


Lys 


Arg 


Ser 


Glu 




100 










105 










110 






Ser Arg Val 


Pro 


Pro 


Pro 


Ser 


Asp 


Ala 


Pro 


Leu 


Pro 


Phe 


Asp 


Arg 


Val 


115 










120 










125 








Leu Ala Asn 


Glu 


Gin 


Gly 


His 


Tyr 


Asp 


Ala 


Val 


Thr 


Gly 


Lys 


Phe 


Thr 


130 








135 










140 










Cys Gin Val 


Pro 


Gly 


Val 


Tyr 


Tyr 


Phe 


Ala 


Val 


His 


Ala 


Thr 


Val 


Tyr 


145 






150 










155 










160 


Arg Ala Ser 


Leu 


Gin 


Phe 


Asp 


Leu 


Val 


Lys 


Asn 


Gly 


Glu 


Ser 


He 


Ala 






165 










170 










175 




Ser Phe Phe 


Gin 


Phe 


Phe 


Gly 


Gly 


Trp 


Pro 


Lys 


Pro 


Ala 


Ser 


Leu 


Ser 




180 










185 










190 






Gly Gly Ala 


Met 


Val 


Arg 


Leu 


Glu 


Pro 


Glu 


Asp 


Gin 


Val 


Trp 


Val 


Gin 


195 










200 










205 








Val Gly Val 


Gly 


Asp 


Tyr 


lie 


Gly 


He 


Tyr 


Ala 


Ser 


He 


Lys 


Thr 


Asp 


210 








215 










220 










Ser Thr Phe 


Ser 


Gly 


Phe 


Leu 


Val 


Tyr 


Ser 


Asp 


Trp 


His 


Ser 


Ser 


Pro 


225 






230 










235 










240 



Val Phe Ala 



<210> 107 
<211> 1338 
<212> DNA 

<213> Homo sapiens 

<400> 107 
gtcgacccac gcgtccggga ctggggtgac 
agcgcggggg ctggagcacc accaactgga 
gccatcgggg agccgggagg ggggactgcg 
cagcgctatg aggccactcc tcgtcctgct 
actggacgac aacaagatcc ccagcctctg 
gggccaccat ggcagccagg gcttgccggg 
gcccggggct ccgggagaga aaggcgaggg 
gaccccgggc cgcgaggaga ggcgggaccc 
tcggtgcctc cgcgatccgc cttcagcgcc 
tctgacgcac ccttgccctt cgaccgcgtg 
gtcaccggca agttcacctg ccaggtgcct 
gtctaccggg ccagcctgca gtttgatctg 
ttccagtttt tcggggggtg gcccaagcca 
ct 99agcctg aggaccaagt gtgggtgcag 
gccagcatca agacagacag caccttctcc 
tccccagtct ttgcttagtg cccactgcaa 
gggtgtgagg ctgacaacct ggtcatccag 
gactagggag gtggggtaga gcactctccg 
gctgtctgcg atcaggtctg gcagcatggg 
ggagtgtgct gtgctggcaa gtgtaagtcc 
ggggtgctct cttcctggtc ctctgcttct 
gggccggccc ttttctcaga gatcactcaa 
aaaaaaaagg gcggccgc 



ggcagggcag ggggcgcctg gccggggaga 60 

gggtccggag tagcgagcgc cccgaaggag 120 

agaggacccc ggcgtccggg ctcccggtgc 180 

gctcctgggc ctggcggccg gctcgccccc 240 

cccggggcac cccggccttc caggcacgcc 300 

ccgcgatggc cgcgacggcc gcgacggcgc 3 60 

cgggaggcgg gactgccggg acctcgaggg 420 

gcggggccca ccgggcctgc cggggagtgc 4 80 

aagcgctccg agagccgggt gcctccgccg 540 

ctggcgaacg agcagggaca ttacgacgcc 600 

ggggtctact acttcgccgt ccatgccacc 660 

gtgaagaatg gcgaatccat tgcctctttc 720 

gcctcgctct cggggggggc catggtgagg 780 

gtgggtgtgg gtgactacat tggcatctat 840 

ggatttctgg tgtactccga ctggcacagc 900 

agtgagctca tgctctcact cctagaagga 960 

gagggctggc ccccctggaa tattgtgaat 1020 

tcctgctgct ggcaaggaat gggaacagtg 1080 

gcagtggctg gatttctgcc caagaccaga 1140 

cccagttgct ctggtccagg agcccacggt 12 00 

ctggatcctc cccaccccct cctgctcctg 1260 

taaacctaag aaccctccaa aaaaaaaaaa 1320 



1338 
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<210> 108 

<211> 243 

<212> PRT 

<213> Homo sapiens 



<400> 108 



Met Ara 


Pro 


Leu. 


Leu 


Val 


Leu 


uc u 


JJC u 




uiy 


T .01 1 


Al a 
Ala 


Ala 
Ala 


uiy 


Ser 


i 

X 






c 

3 










1 u 










lb 




Pro Prn 




Ren 


Ann 


Ren 
noli 


i-i y 0 


Tip 

1 JLC 


riO 


Ser 


Leu 


Cys 


Pro 


uiy 


HI S 


Pro 






Cm \J 










3 CC 
*s 3 










*3 A 
3 0 






f3l \/ TiPii 


Pro 


gi v 

U X jr 


T*Vi t* 

i. Hi 


Prn 


uiy 


nib 


fll s 


uiy 


OCX 


uin 


uiy 


Leu 


Pro 


uiy 
























4 3 








A v-rr Act"} 


Glv 
uiy 




Aon 


v?iy 


Arg 


A en 


uiy 


Al a 
Aid 


Pro 


uiy 


Ala 
Ala. 


Pro 


uiy 


ulU 


3 u 










33 










60 










T ,\/a /^n \r 
uy a uiy 


VJ1U 


uiy 


uiy 


Arg 


rlO 


uiy 


Leu 


Pro 


uiy 


Pro 


Arg 


uiy 


Asp 


Pro 










/ \) 










73 










O A 

80 


f2l \/ Drn 


A yrj 

Arg 


ux y 


ulu 


Al a 


Uiy 




A 1 a 
AX a 


uiy 


Pro 


inr 


uiy 


Pro 


ai a 
Aia 


uiy 








O 3 










Q A 










A C 

95 




f2l n r*\rc 


Coy* 
OCX 


Veil 




Pro 


Arg 


Ser 


Ala 

ax a 


rfle 


Ser 


a 1 a 

Aia 


Lys 


Arg 


Ser 


ulU 






1U U 










t a c 
1Q3 










1 *l A 
110 






Set* Arg 


Vdl 


Pro 


Pro 


Pro 


Ser 


Asp 


Ala 

Aia 


Pro 


Leu 


Pro 


Fne 


Asp 


Arg 


val 




113 










lz u 










125 








TiPii Val 
uc u. vai 


A en 


m n 

Ul u 


U X 11 


Uiy 


nis 


Tyr 


Asp 


Ala 

ax a 


vai 


1 nr 


Al 

uiy 


Lys 


pne 


mr 


XJU 










1 "3 C 
1J 3 










140 










Ovq f^l n 


v a. 1 


XT X. \J 




Val 


ryr 


Tyr 


.fiie 


Al a 

Aia 


Va 1 

vai 


ills 


Aia 


lnr 


vai 


Tyr 










13 \J 










ICC 
133 










1 ^ A 

160 


Arg Ala 


Ser 


Leu 


Gin 


Phe 


Asp 


Leu 


Val 


Lys 


Asn 


Gly 


Glu 


Ser 


Leu 


Ala 








165 










170 










175 




Ser Phe 


Phe 


Gin 


Phe 


Phe 


Gly 


Gly 


Trp 


Pro 


Lys 


Pro 


Ala 


Ser 


Leu 


Ser 






180 










185 










190 






Gly Gly 


Ala 


Met 


Val 


Arg 


Leu 


Glu 


Pro 


Glu 


Asp 


Gin 


Val 


Trp 


Val 


Gin 




195 










200 










205 








Val Gly 


Val 


Gly 


Asp 


Tyr 


He 


Gly 


He 


Tyr 


Ala 


Ser 


He 


Lys 


Thr 


Asp 


210 










215 










220 










Ser Thr 


Phe 


Ser 


Gly 


Phe 


Leu 


Val 


Tyr 


Ser 


Asp 


Trp 


His 


Ser 


Ser 


Pro 


225 








230 










235 










240 



Val Phe Ala 



<210> 109 

<211> 1338 

<212> DNA 

<213> Homo sapiens 



<400> 109 

gtcgacccac gcgtccggga ctggggtgac ggcagggcag ggggcgcctg geeggggaga 60 

agegeggggg ctggagcacc accaactgga gggtceggag tagegagege cccgaaggag 12 0 

gecategggg ageegggagg ggggactgcg agaggacccc ggcgtccggg ctcccggtgc 180 

cagegctatg aggccactcc tcgtcctgct gctcctgggc ctggcggccg gctcgccccc 240 

actggacgac aacaagatcc ccagcctctg cccggggcac cccggccttc caggcacgcc 300 

gggccaccat ggcagecagg gettgeeggg ccgcgatggc cgcgacggcc gcgacggcgc 360 

gcccggggct cegggagaga aaggcgaggg egggaggegg gaetgeeggg acctcgaggg 42 0 

gaccccgggc cgegaggaga ggcgggaccc gcggggccca ccgggcctgc cggggagtgc 480 

tcggtgcctc cgcgatccgc cttcagcgcc aagcgctccg agagcegggt gcctccgccg 54 0 

tctgacgcac ccttgccctt cgaccgcgtg ctggtgaacg agcagggaca ttacgacgcc 600 

gtcaccggca agttcacctg ccaggtgcct ggggtctact acttcgccgt ccatgccacc 660 

gtctaccggg ccagcctgca gtttgatctg gtgaagaatg gcgaatccct tgcctctttc 720 

ttccagtttt teggggggtg gcccaagcca gcctcgctct egggggggge catggtgagg 780 
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ctggagcctg aggaccaagt gtgggtgcag gtgggtgtgg gtgactacat tggcatctat 84 0 

gccagcatca agacagacag caccttctcc ggatttctgg tgtactccga ctggcacagc 900 

tccccagtct ttgcttagtg cccactgcaa agtgagctca tgctctcact cctagaagga 960 

999tgtgagg ctgacaacct ggtcatccag gagggctggc ccccctggaa tattgtgaat 1020 

gactagggag gtggggtaga gcactctccg tcctgctgct ggcaaggaat gggaacagtg 1080 

gctgtctgcg atcaggtctg gcagcatggg gcagtggctg gatttctgcc caagaccaga 114 0 

ggagtgtgct gtgctggcaa gtgtaagtcc cccagttgct ctggtccagg agcccacggt 1200 

ggggtgctct cttcctggtc ctctgcttct ctggatcctc cccaccccct cctgctcctg 1260 

gggccggccc ttttctcaga gatcactcaa taaacctaag aaccctccaa aaaaaaaaaa 132 0 

aaaaaaaagg gcggccgc 13 3 8 

<210> 110 
<211> 406 
<212> PRT 

<213> Homo sapiens 



<400> 110 



Met Gly 


Pro 


Ser 


Thr 


Pro 


Leu 


Leu 


He 


Leu 


Phe Leu 


Leu 


Ser 


Trp 


Ser 


1 






5 










10 








15 




Gly Pro 


Leu 


Gin 


Gly 


Gin 


Gin 


His 


His 


Leu 


Val Glu 


Tvr 


Met 


Glu 


Ara 






20 










25 








30 






Arg Leu 


Ala 


Ala 


Leu 


Glu 


Glu 


Arg 


Leu 


Ala 


Gin Cys 


Gin 


Asp 


Gin 


Ser 




35 










40 








45 








Ser Arg 


His 


Ala 


Ala 


Glu 


Leu 


Arq 


Asp 


Phe 


Lys Asn 


Lvs 


Met 


Leu 


Pro 


50 










55 








60 










Leu Leu 


Glu 


Val 


Ala 


Glu 


Lys 


Glu 


Arg 


Glu 


Ala Leu 


Ara 


Thr 


Glu 


Ala 
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70 










75 








80 


Asp Thr 


lie 


Ser 


Gly 


Arg 


Val 


Asp 


Arg 


Leu 


Glu Arg 


Glu 


Ala 


Asp 


Tvr 
y 








85 










90 








95 




Leu Glu 


Thr 


Gin 


Asn 


Pro 


Ala 


Leu 


Pro 


Cvs 


Val Glu 


Phe 


Asp 


Glu 


Lys 






100 










105 








110 






Val Thr 


Gly 


Gly 


Pro 


Gly 


Thr 


Lvs 


Glv 


Lvs 


Glv Ara 


Arg 


Asn 


Glu 


Lys 




115 










120 








125 








Tyr Asp 


Met 


Val 


Thr 


Asd 


Cys 


Glv 


Tvr 


Thr 


He Ser 


Gin 


Val 




OCX 


130 










135 








140 










Met Lys 


He 


Leu 


Lys 


Ara 


Phe 


Glv 


Glv 


Pro 


Ala Gly 


TjPii 




X 111. 




145 








150 










155 








160 


Asp Pro 


Leu 


Gly 


Gin 


Thr 


Glu 


Lys 


He 


Tyr 


Val Leu 


Asp 


Gly 


Thr 


Gin 








165 










170 








175 




Asn Asp 


Thr 


Ala 


Phe 


Val 


Phe 


Pro 


Arg 


Leu 


Arg Asp 


Phe 


Thr 


Leu 


Ala 






180 










185 








190 






Met Ala 


Ala 


Arg 


Lys 


Ala 


Ser 


Arg 


Val 


Arg 


Val Pro 


Phe 


Pro 


Trp 


Val 




195 










200 








205 








Gly Thr 


Gly 


Gin 


Leu 


Val 


Tyr 


Gly 


Gly 


Phe 


Leu Tyr 


Phe 


Ala 


Arg 


Arg 


210 










215 








220 










Pro Pro 


Gly 


Arg 


Pro 


Gly 


Gly 


Gly 


Gly 


Glu 


Met Glu 


Asn 


Thr 


Leu 


Gin 


225 








230 










235 








240 


Leu lie 


Lys 


Phe 


His 


Leu 


Ala 


Asn 


Arg 


Thr 


Val Val 


Asp 


Ser 


Ser 


Val 








245 










250 








255 




Phe Pro 


Ala 


Glu 


Gly 


Leu 


He 


Pro 


Pro 


Tyr 


Gly Leu 


Thr 


Ala 


Asp 


Thr 






260 










265 








270 






Tyr lie 


Asp 


Leu 


Ala 


Ala 


Asp 


Glu 


Glu 


Gly 


Leu .Trp 


Ala 


Val 


Tyr 


Ala 




275 










280 








285 








Thr Arg 


Glu 


Asp 


Asp 


Arg 


His 


Leu 


Cys 


Leu 


Ala Lys 


Leu 


Asp 


Pro 


Gin 


290 










295 








300 










Thr Leu 


Asp 


Thr 


Glu 


Gin 


Gin 


Trp 


Asp 


Thr 


Pro Cys 


Pro 


Arg 


Glu 


Asn 


305 








310 










315 








320 


Ala Glu 


Ala 


Ala 


Phe 


Val 


He 


Cys 


Gly 


Thr 


Leu Tyr 


Val 


Val 


Tyr 


Asn 
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nor 
3 Z D 






330 




*3 o c 
3 35 


inir Kiy Fro 


Aid 

340 


Ser 


Arg 


Ala Arg 


He Gin 
345 


Cys 


ber pne Asp AJ.a ber 
350 


Kjiy inr Lieu 


Thr 


Pro 


bXU 


Arg Ala Ala Leu Pro 


iyr Fne Fro Arg Arg 


355 








360 






365 


Tyr Gly Ala 


His 


Ala 


Ser 


Leu Arg 


Tyr Asn 


Pro 


Arg Glu Arg Gin Leu 


370 








375 






380 


Tyr Ala Trp 


Asp Asp 


Gly 


Tyr Gin 


He Val 


Tyr 


Lys Leu Glu Met Arg 


385 






390 






395 


400 


Lys Lys Glu 


Glu 


Glu 
405 


Val 











<210> ill 

<211> 1831 

<212> DNA 

<213> Homo sapiens 

<400> 111 

gtcgacccac gcgtccgcgg acgcgtgggt gaggggaaga ggctgactgt acgttccttc 60 

tactctggca ccactctcca ggctgccatg gggcccagca cccctctcct catcttgttc 120 

cttttgtcat ggtcgggacc cctccaagga cagcagcacc accttgtgga gtacatggaa 180 

cgccgactag ctgctttaga ggaacggctg gcccagtgcc aggaccagag tagtcggcat 240 

gctgctgagc tgcgggactt caagaacaag atgctgccac tgctggaggt ggcagagaag 300 

gagcgggagg cactcagaac tgaggccgac accatctccg ggagagtgga tcgtctggag 360 

cgggaggcag actatctgga gacccagaac ccagctctgc cctgtgtaga gtttgatgag 420 

aaggtgactg gaggccctgg gaccaaaggc aagggaagaa ggaatgagaa gtacgatatg 4 80 

gtgacagact gtggctacac aatctctcaa gtgagatcaa tgaagattct gaagcgattt 540 

ggtggcccag ctggtctatg gaccaaggat ccactggggc aaacagagaa gatctacgtg 600 

ttagatggga cacagaatga cacagccttt gtcttcccaa ggctgcgtga cttcaccctt 660 

gccatggctg cccggaaagc ttcccgagtc cgggtgccct tcccctgggt aggcacaggg 720 

cagctggtat atggtggctt tctttatttt gctcggaggc ctcctggaag acctggtgga 780 

ggtggtgaga tggagaacac tttgcagcta atcaaattcc acctggcaaa ccgaacagtg 84 0 

gtggacagct cagtattccc agcagagggg ctgatccccc cctacggctt gacagcagac 900 

acctacatcg acctggcagc tgatgaggaa ggtctttggg ctgtctatgc cacccgggag 960 

gatgacaggc acttgtgtct ggccaagtta gatccacaga cactggacac agagcagcag 102 0 

tgggacacac catgtcccag agagaatgct gaggctgcct ttgtcatctg tgggaccctc 1080 

tatgtcgtct ataacacccg tcctgccagt cgggcccgca tccagtgctc ctttgatgcc 114 0 

agcggcaccc tgacccctga acgggcagca ctcccttatt ttccccgcag atatggtgcc 1200 

catgccagcc tccgctataa cccccgagaa cgccagctct atgcctggga tgatggctac 1260 

cagattgtct ataagctgga gatgaggaag aaagaggagg aggtttgagg agctagcctt 1320 

gttttttgca tctttctcac tcccatacat ttatattata tccccactaa atttcttgtt 1380 

cctcattctt caaatgtggg ccagttgtgg ctcaaatcct ctatattttt agccaatggc 1440 

aatcaaattc tttcagctcc tttgtttcat acggaactcc agatcctgag taatcctttt 1500 

agagcccgaa gagtcaaaac cctcaatgtt ccctcctgct ctcctgcccc atgtcaacaa 1560 

atttcaggct aaggatgccc cagacccagg gctctaacct tgtatgcggg caggcccagg 162 0 

gagcaggcag cagtgttctt cccctcagag tgacttgggg agggagaaat aggaggagac 168 0 

gtccagctct gtcctctctt cctcactcct cccttcagtg tcctgaggaa caggactttc 1740 

tccacattgt tttgtattgc aacattttgc attaaaagga aaatccactg ctaaaaaaaa 1800 

aaaaaaaaaa aaaaaaaaaa agggcggccg c 1831 

<210> 112 
<211> 406 
<212> PRT 
<213> Homo sapiens 

<400> 112 

Met Gly Pro Ser Thr Pro Leu Leu He Leu Phe Leu Leu Ser Trp Ser 
15 10 15 
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Gly Pro Leu 


Gin Gly 


Gin 


Gin 


His His 


Leu 


Val Glu 


Tyr 


Met 


Glu 


Arg 




20 








25 








30 




Arg Leu Ala 


Ala 


Leu 


Glu 


Glu 


Arg Leu 


Ala 


Gin Cys 


Gin 


Asp 


Gin 


Ser 


35 










40 






45 








Ser Arg His 


Ala 


Ala 


Glu 


Leu 


Arg Asp 


Phe 


Lys Asn 


Lys 


Met 


Leu 


Pro 


50 








55 






60 










Leu Leu Glu 


Val 


Ala 


Glu 


Lys 


Glu Arg 


Glu 


Ala Leu 


Arg 


Thr 


Glu 


Ala 


65 






70 








75 








80 


Asp Thr lie 


Ser Gly 


Arg 


Val 


Asp Arg 


Leu 


Glu Arg 


Glu 


Val 


Asp 


Tyr 






85 








90 








95 




Leu Glu Thr 


Gin 


Asn 


Pro 


Ala 


Leu Pro 


Cys 


Val Glu 


Phe 


Asp 


Glu 


Lys 




100 








105 








110 




Val Thr Gly 


Gly 


Pro 


Gly 


Thr 


Lys Gly 


Lys 


Gly Arg 


Arg 


Asn 


Glu 


Lys 


115 










120 






125 








Tyr Asp lie 


Val 


Thr 


Asp 


Cys 


Gly Tyr 


Thr 


He Ser 


Gin 


Val 


Arg 


Ser 


130 








135 






140 










Met Lys lie 


Leu 


Lys 


Arg 


Phe 


Gly Gly 


Pro 


Ala Gly 


Leu 


Trp 


Thr 


Lys 


145 






150 








155 








160 


Asp Pro Leu 


Gly Gin 


Thr 


Glu 


Lys He 


Tyr 


Val Leu 


Asp 


Gly Thr Gin 






165 








170 








175 




Asn Asp Thr 


Ala 


Phe 


Val 


Phe 


Pro Arg 


Leu 


Arg Asp 


Phe 


Thr 


Leu 


Ala 




180 








185 








190 






Met Ala Ala 


Arg 


Lys 


Ala 


Ser 


Arg Val 


Arg 


Val Pro 


Phe 


Pro 


Trp Val 


195 










200 






205 








Gly Thr Gly 


Gin 


Leu 


Val 


Tyr 


Gly Gly 


Phe 


Leu Tyr 


Phe 


Ala 


Arg 


Arg 


210 








215 






220 










Pro Pro Gly 


Arg 


Pro 


Gly 


Gly 


Gly Gly 


Glu 


Met Glu 


Asn 


Thr 


Leu 


Gin 


225 






230 








235 








240 


Leu lie Lys 


Phe 


His 


Leu 


Ala 


Asn Arg 


Thr 


Val Val 


Asp 


Ser 


Ser 


Val 






245 








250 








255 




Phe Pro Ala 


Glu 


Gly 


Leu 


He 


Pro Pro 


Tyr 


Gly Leu 


Thr 


Ala 


Asp 


Thr 




260 








265 








270 






Tyr lie Asp 


Leu 


Ala 


Ala 


Asp 


Glu Glu 


Gly 


Leu Trp 


Ala 


Val 


Tyr 


Ala 


275 










280 






285 








Thr Arg Glu 


Asp Asp 


Arg 


His 


Leu Cys 


Leu 


Ala Lys 


Leu 


Asp 


Pro 


Gin 


290 








295 






300 










Thr Leu Asp 


Thr 


Glu 


Gin 


Gin 


Trp Asp 


Thr 


Pro Cys 


Pro 


Arg 


Glu 


Asn 


305 






310 








315 








320 


Ala Glu Ala 


Ala 


Phe 


Val 


lie 


Cys Gly 


Thr 


Leu Tyr 


Val 


Val 


Tyr 


Asn 






325 








330 








335 




Thr Arg Pro 


Ala 


Ser 


Arg 


Ala 


Arg He 


Gin 


Cys Ser 


Phe 


Asp 


Ala 


Ser 




340 








345 








350 






Gly Thr Leu 


Thr 


Pro 


Glu 


Arg 


Ala Ala 


Leu 


Pro Tyr 


Phe 


Pro Arg 


Arg 


355 










360 






365 








Tyr Gly Ala 


His 


Ala 


Ser 


Leu 


Arg Tyr 


Asn 


Pro Arg 


Glu 


Arg 


Gin 


Leu 


370 








375 






380 










Tyr Ala Trp 


Asp Asp 


Gly 


Tyr 


Gin He 


Val 


Tyr Lys 


Leu 


Glu 


Met 


Arg 


385 






390 








395 








400 


Lys Lys Glu 


Glu 


Glu 


Val 



















405 



<210> 113 
<211> 1831 
<212> DNA 

<213> Homo sapiens 
<400> 113 

gtcgacccac gcgtccgcgg acgcgtgggt gaggggaaga ggctgactgt acgttccttc 60 
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tactctggca ccactctcca ggctgccatg gggcccagca cccctctcct catcttgttc 12 0 

cttttgtcat ggtcgggacc cctccaagga cagcagcacc accttgtgga gtacatggaa 180 

cgccgactag ctgctttaga ggaacggctg gcccagtgcc aggaccagag tagtcggcat 240 

gctgctgagc tgcgggactt caagaacaag atgctgccac tgctggaggt ggcagagaag 3 00 

gagcgggagg cactcagaac tgaggccgac accatctccg ggagagtgga tcgtctggag 360 

cgggaggtag actatctgga gacccagaac ccagctctgc cctgtgtaga gtttgatgag 420 

aaggtgactg gaggccctgg gaccaaaggc aagggaagaa ggaatgagaa gtacgatata 4 80 

gtgacagact gtggctacac aatctctcaa gtgagatcaa tgaagattct gaagcgattt 540 

ggtggcccag ctggtctatg gaccaaggat ccactggggc aaacagagaa gatctacgtg 600 

ttagatggga cacagaatga cacagccttt gtcttcccaa ggctgcgtga cttcaccctt 660 

gccatggctg cccggaaagc ttcccgagtc cgggtgccct tcccctgggt aggcacaggg 72 0 

cagctggtat atggtggctt tctttatttt gctcggaggc ctcctggaag acctggtgga 780 

ggtggtgaga tggagaacac tttgcagcta atcaaattcc acctggcaaa ccgaacagtg 840 

gtggacagct cagtattccc agcagagggg ctgatccccc cctacggctt gacagcagac 900 

acctacatcg acctggcagc tgatgaggaa ggtctttggg ctgtctatgc cacccgggag 960 

gatgacaggc acttgtgtct ggccaagtta gatccacaga cactggacac agagcagcag 1020 

tgggacacac catgtcccag agagaatgct gaggctgcct ttgtcatctg tgggaccctc 1080 

tatgtcgtct ataacacccg tcctgccagt cgggcccgca tccagtgctc ctttgatgcc 1140 

agcggcaccc tgacccctga acgggcagca ctcccttatt ttccccgcag atatggtgcc 1200 

catgccagcc tccgctataa cccccgagaa cgccagctct atgcctggga tgatggctac 1260 

cagattgtct ataagctgga gatgaggaag aaagaggagg aggtttgagg agctagcctt 132 0 

gttttttgca tctttctcac tcccatacat ttatattata tccccactaa atttcttgtt 1380 

cctcattctt caaatgtggg ccagttgtgg ctcaaatcct ctatattttt agccaatggc 1440 

aatcaaattc tttcagctcc tttgtttcat acggaactcc agatcctgag taatcctttt 1500 

agagcccgaa gagtcaaaac cctcaatgtt ccctcctgct ctcctgcccc atgtcaacaa 1560 

atttcaggct aaggatgccc cagacccagg gctctaacct tgtatgcggg caggcccagg 1620 

gagcaggcag cagtgttctt cccctcagag tgacttgggg agggagaaat aggaggagac 1680 

gtccagctct gtcctctctt cctcactcct cccttcagtg tcctgaggaa caggactttc 1740 

tccacattgt tttgtattgc aacattttgc attaaaagga aaatccactg ctaaaaaaaa 1800 

aaaaaaaaaa aaaaaaaaaa agggcggccg c 1831 

<210> 114 
<211> 406 
<212> PRT 

<213> Homo sapiens 
<400> 114 

Met Gly Pro Ser Thr Pro Leu Leu lie Leu Phe Leu Leu Ser Trp Ser 

15 10 15 

Gly Pro Leu Gin Gly Gin Gin His His Leu Val Glu Tyr Met Glu Arg 

20 25 30 

Arg Leu Ala Ala Leu Glu Glu Arg Leu Ala Gin Cys Gin Asp Gin Ser 

35 40 45 

Ser Arg His Ala Ala Glu Leu Arg Asp Phe Lys Asn Lys Met Leu Pro 

50 55 60 

Leu Leu Glu Val Ala Glu Lys Glu Arg Glu Ala Leu Arg Thr Glu Ala 
65 70 75 80 

Asp Thr lie Ser Gly Arg Val Asp Arg Leu Glu Arg Glu Val Asp Tyr 

85 90 95 

Leu Glu Thr Gin Asn Pro Ala Leu Pro Cys Val Glu Phe Asp Glu Lys 

100 105 110 

Val Thr Gly Gly Pro Gly Thr Lys Gly Lys Gly Arg Arg Asn Glu Lys 

115 120 125 

Tyr Asp Met Val Thr Asp Cys Gly Tyr Thr lie Ser Gin Val Arg Ser 

130 135 140 

Met Lys lie Leu Lys Arg Phe Gly Gly Pro Ala Gly lie Trp Thr Lys 
145 150 155 160 

Asp Pro Leu Gly Gin Thr Glu Lys lie Tyr Val Leu Asp Gly Thr Gin 
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165 








170 










175 




Asn Asp 


Thr Ala 
180 


Phe 


Val 


Phe 


Pro Arg 
185 


Leu 


Arg 


Asp 


Phe 


Thr 
190 


Leu 


Ala 


Met Ala 


Ala Arg 
195 


Lys 


Ala 


Ser 


Arg Val 
200 


Arg 


Val 


Pro 


Phe 
205 


Pro 


Trp 


Val 


Gly Thr 


Gly Gin 


Leu 


Val 


Tyr 


Gly Gly 


Phe 


Leu 


Tyr 


Phe 


Ala 


Arg 


Arg 


210 








215 








220 










Pro Pro 


Gly Arg 


Pro 


Gly 


Gly 


Gly Gly 


Glu 


Met 


Glu 


Asn 


Thr 


Leu 


Gin 


225 






230 








235 










24 0 


Leu He 


Lys Phe 


His 
245 


Leu 


Ala 


Asn Arg 


Thr 
250 


Val 


Val 


Asp 


Ser 


Ser 
255 


Val 


Phe Pro 


Ala Glu 
260 


Gly 


Leu 


He 


Pro Pro 
265 


Tyr 


Gly 


Leu 


Thr 


Ala 
270 


Asd 

It 


Thr 


Tyr He 


Asp Leu 
275 


Ala 


Ala 


Asp 


Glu Glu 
280 


Gly 


Leu 


Trp 


Ala 
285 


Val 


Tyr 


Ala 


Thr Arg 


Glu Asp 


Asp 


Arg 


His 


Leu Cys 


Leu 


Ala 


Lys 


Leu 


Asp 


Pro 


Gin 


290 








295 








300 










Thr Leu 


Asp Thr 


Glu 


Gin 


Gin 


Trp Asp 


Thr 


Pro 


Cys 


Pro 


Arg 


Glu 


Asn 


305 






310 








315 










320 


Ala Glu 


Ala Ala 


Phe 
325 


Val 


He 


Cys Gly 


Thr 
330 


Leu 


Tyr 


Val 


Val 


Tyr 
33 5 


Asn 


Thr Arg 


Pro Ala 
340 


Ser 


Arg 


Ala 


Arg He 
345 


Gin 


Cys 


Ser 


Phe 


Asd 
350 


Ala 


Ser 


Gly Thr 


Leu Thr 
355 


Pro 


Glu 


Arg 


Ala Ala 
360 


Leu 


Pro 


Tyr 


Phe 
365 


Pro 


Arg 


Arcr 


Tyr Gly 


Ala His 


Ala 


Ser 


Leu 


Arg Tyr 


Asn 


Pro 


Arg 


Glu 


Arg 


Gin 


Leu 


370 








375 








3 80 










Tyr Ala 


Trp Asp 


Asp 


Gly 


Tyr 


Gin He 


Val 


Tyr 


Lys 


Leu 


Glu 


Met 


Arg 


385 






390 








395 










400 


Lys Lys 


Glu Glu 


Glu 
405 


Val 





















<210> 115 
<211> 1831 
<212> DNA 

<213> Homo sapiens 
<400> 115 

gtcgacccac gcgtccgcgg acgcgtgggt gaggggaaga ggctgactgt acgttccttc 60 

tactctggca ccactctcca ggctgccatg gggcccagca cccctctcct catcttgttc 120 

cttttgtcat ggtcgggacc cctccaagga cagcagcacc accttgtgga gtacatggaa 180 

cgccgactag ctgctttaga ggaacggctg gcccagtgcc aggaccagag tagtcggcat 24 0 

gctgctgagc tgcgggactt caagaacaag atgctgccac tgctggaggt ggcagagaag 3 00 

gagcgggagg cactcagaac tgaggccgac accatctccg ggagagtgga tcgtctggag 3 60 

cgggaggtag actatctgga gacccagaac ccagctctgc cctgtgtaga gtttgatgag 420 

aaggtgactg gaggccctgg gaccaaaggc aagggaagaa ggaatgagaa gtacgatatg 4 80 

gtgacagact gtggctacac aatctctcaa gtgagatcaa tgaagattct gaagcgattt 540 

ggtggcccag ctggtatatg gaccaaggat ccactggggc aaacagagaa gatctacgtg 600 

ttagatggga cacagaatga cacagccttt gtcttcccaa ggctgcgtga cttcaccctt 660 

gccatggctg cccggaaagc ttcccgagtc cgggtgccct tcccctgggt aggcacaggg 720 

cagctggtat atggtggctt tctttatttt gctcggaggc ctcctggaag acctggtgga 780 

ggtggtgaga tggagaacac tttgcagcta atcaaattcc acctggcaaa ccgaacagtg 84 0 

gtggacagct cagtattccc agcagagggg ctgatccccc cctacggctt gacagcagac 900 

acctacatcg acctggcagc tgatgaggaa ggtctttggg ctgtctatgc cacccgggag 960 

gatgacaggc acttgtgtct ggccaagtta gatccacaga cactggacac agagcagcag 102 0 

tgggacacac catgtcccag agagaatgct gaggctgcct ttgtcatctg tgggaccctc 1080 

tatgtcgtct ataacacccg tcctgccagt cgggcccgca tccagtgctc ctttgatgcc 1140 

agcggcaccc tgacccctga acgggcagca ctcccttatt ttccccgcag atatggtgcc 1200 
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catgccagcc tccgctataa cccccgagaa cgccagctct 
cagattgtct ataagctgga gatgaggaag aaagaggagg 
gttttttgca tctttctcac tcccatacat ttatattata 
cctcattctt caaatgtggg ccagttgtgg ctcaaatcct 
aatcaaattc tttcagctcc tttgtttcat acggaactcc 
agagcccgaa gagtcaaaac cctcaatgtt ccctcctgct 
atttcaggct aaggatgccc cagacccagg gctctaacct 
gagcaggcag cagtgttctt cccctcagag tgacttgggg 
gtccagctct gtcctctctt cctcactcct cccttcagtg 
tccacattgt tttgtattgc aacattttgc attaaaagga 
aaaaaaaaaa aaaaaaaaaa agggcggccg c 

<210> 116 

<211> 406 

<212> PRT 

<213> Homo sapiens 



atgcctggga 


tgatggctac 


1260 


aggtttgagg 


agctagcctt 


1320 


tccccactaa 


atttcttgtt 


1380 


ctatattttt 


agccaatggc 


1440 


agatcctgag 


taatcctttt 


1500 


ctcctgcccc 


atgtcaacaa 


1560 


tgtatgcggg 


caggcccagg 


1620 


agggagaaat 


aggaggagac 


1680 


tcctgaggaa 


caggactttc 


1740 


aaatccactg 


ctaaaaaaaa 


1800 






1831 



<400> 116 



Met 


Gly 


Pro 


Ser 


1 








Gly 


Pro 


Leu 


Gin 








20 


Arg 


Leu 


Ala 


Ala 






35 




Ser 


Arg 


His 


Ala 




50 






Leu 


Leu 


Glu 


Val 


65 








Asp 


Thr 


He 


Ser 


Leu 


Glu 


Thr 


Gin 








100 


Val 


Thr 


Gly 


Gly 






115 




Tyr 


Asp 


Met 


Val 




130 






Met 


Lys 


He 


Leu 


145 








Asp 


Pro 


Leu 


Gly 


Asn 


Asp 


Thr 


Val 








180 


Met 


Ala 


Ala 


Arg 






195 




Gly 


Thr 


Gly 


Gin 




210 






Pro 


Pro 


Gly 


Arg 


225 








Leu 


He 


Lys 


Phe 


Phe 


Pro 


Ala 


Glu 








260 


Tyr 


He 


Asp 


Leu 






275 




Thr 


Arg 


Glu 


Asp 




290 






Thr 


Leu 


Asp 


Thr 



305 



Thr 


Pro 


Leu 


Leu 


5 








Gly 


Gin 


Gin 


His 


Leu 


Glu 


Glu 


Arg 








40 


Ala 


Glu 


Leu 


Arg 






55 




Ala 


Glu 


Lys 


Glu 




70 






Gly 


Arg 


Val 


Asp 


85 








Asn 


Pro 


Ala 


Leu 


Pro 


Gly 


Thr 


Lys 








120 


Thr 


Asp 


Cys 


Gly 






135 




Lys 


Arg 


Phe 


Gly 




150 






Gin 


Thr 


Glu 


Lys 


165 








Phe 


Val 


Phe 


Pro 


Lys 


Ala 


Ser 


Arg 








200 


Leu 


Val 


Tyr 


Gly 






215 




Pro 


Gly 


Gly 


Gly 




230 






His 


Leu 


Ala 


Asn 


245 








Gly 


Leu 


He 


Pro 


Ala 


Ala 


Asp 


Glu 








280 


Asp 


Arg 


His 


Leu 






295 




Glu 


Gin 


Gin 


Trp 




310 







He 


Leu 


Phe 


Leu 




10 






His 


Leu 


Val 


Glu 


25 








Leu 


Ala 


Gin 


Cys 


Asp 


Phe 


Lys 


Asn 








60 


Arg 


Glu 


Ala 


Leu 






75 




Arg 


Leu 


Glu 


Arg 




90 






Pro 


Cys 


Val 


Glu 


105 








Gly 


Lys 


Gly 


Arg 


Tyr 


Thr 


He 


Ser 








140 


Gly 


Pro 


Ala 


Gly 






155 




He 


Tyr 


Val 


Leu 




170 






Arg 


Leu 


Arg 


Asp 


185 








Val 


Arg 


Val 


Pro 


Gly 


Phe 


Leu 


Tyr 








220 


Gly 


Glu 


Met 


Glu 






235 




Arg 


Thr 


Val 


Val 




250 






Pro 


Tyr 


Gly 


Leu 


265 








Glu 


Gly 


Leu 


Trp 


Cys 


Leu 


Ala 


Lys 








300 


Asp 


Thr 


Pro 


Cys 






315 





Leu 


Ser 


Trp 


Ser 






15 




Tyr 


Met 


Glu 


Arg 




30 






Gin 


Asp 


Gin 


Ser 


45 








Lys 


Met 


Leu 


Pro 


Arg 


Thr 


Glu 


Ala 








80 


Glu 


Val 


Asp 


Tyr 






95 




Phe 


Asp 


Glu 


Lys 




110 






Arg 


Asn 


Glu 


Lys 


125 








Gin 


Val 


Arg 


Ser 


Leu 


Trp 


Thr 


Lys 








160 


Asp 


Gly 


Thr 


Gin 






175 




Phe 


Thr 


Leu 


Ala 




190 






Phe 


Pro 


Trp 


Val 


205 








Phe 


Ala 


Arg 


Arg 


Asn 


Thr 


Leu 


Gin 








240 


Asp 


Ser 


Ser 


Val 






255 




Thr 


Ala 


Asp 


Thr 




270 






Ala 


Val 


Tyr 


Ala 


285 








Leu 


Asp 


Pro 


Gin 


Pro 


Arg 


Glu 


Asn 
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Ala Glu 


Ala Ala 


Phe 


Val 


lie 


Cys Gly Thr Leu 


Tyr 


Val 


Val 


Tyr Asn 






325 






330 










335 


Thr Arg 


Pro Ala 
340 


Ser 


Arg 


Ala 


Arg lie Gin 
345 


Cys 


Ser 


Phe 


Asp 
350 


Ala Ser 


Gly Thr 


Leu Thr 


Pro 


Glu 


Arg 


Ala Ala Leu 


Pro 


Tyr 


Phe 


Pro 


Arg Arg 




355 








360 






365 




Tyr Gly Ala His 


Ala 


Ser 


Leu 


Arg Tyr Asn 


Pro 


Arg Glu Arg 


Gin Leu 


370 








375 






380 








Tyr Ala 


Trp Asp 


Asp 


Gly 


Tyr 


Gin He Val 


Tyr 


Lys 


Leu 


Glu 


Met Arg 


385 






390 






395 








400 


Lys Lys 


Glu Glu 


Glu 
405 


Val 

















<210> 117 

<211> 1831 

<212> DNA 

<213> Homo sapiens 

<400> 117 

gtcgacccac gcgtccgcgg acgcgtgggt gaggggaaga ggctgactgt acgttccttc 60 

tactctggca ccactctcca ggctgccatg gggcccagca cccctctcct catcttgttc 120 

cttttgtcat ggtcgggacc cctccaagga cagcagcacc accttgtgga gtacatggaa 180 

cgccgactag ctgctttaga ggaacggctg gcccagtgcc aggaccagag tagtcggcat 24 0 

gctgctgagc tgcgggactt caagaacaag atgctgccac tgctggaggt ggcagagaag 3 00 

gagcgggagg cactcagaac tgaggccgac accatctccg ggagagtgga tcgtctggag 360 

cgggaggtag actatctgga gacccagaac ccagctctgc cctgtgtaga gtttgatgag 420 

aaggtgactg gaggccctgg gaccaaaggc aagggaagaa ggaatgagaa gtacgatatg 4 80 

gtgacagact gtggctacac aatctctcaa gtgagatcaa tgaagattct gaagcgattt 540 

ggtggcccag ctggtctatg gaccaaggat ccactggggc aaacagagaa gatctacgtg 600 

ttagatggga cacagaatga cacagtcttt gtcttcccaa ggctgcgtga cttcaccctt 660 

gccatggctg cccggaaagc ttcccgagtc cgggtgccct tcccctgggt aggcacaggg 720 

cagctggtat atggtggctt tctttatttt gctcggaggc ctcctggaag acctggtgga 780 

ggtggtgaga tggagaacac tttgcagcta atcaaattcc acctggcaaa ccgaacagtg 840 

gtggacagct cagtattccc agcagagggg ctgatccccc cctacggctt gacagcagac 900 

acctacatcg acctggcagc tgatgaggaa ggtctttggg ctgtctatgc cacccgggag 960 

gatgacaggc acttgtgtct ggccaagtta gatccacaga cactggacac agagcagcag 1020 

tgggacacac catgtcccag agagaatgct gaggctgcct ttgtcatctg tgggaccctc 1080 

tatgtcgtct ataacacccg tcctgccagt cgggcccgca tccagtgctc ctttgatgcc 1140 

agcggcaccc tgacccctga acgggcagca ctcccttatt ttccccgcag atatggtgcc 1200 

catgccagcc tccgctataa cccccgagaa cgccagctct atgcctggga tgatggctac 1260 

cagattgtct ataagctgga gatgaggaag aaagaggagg aggtttgagg agctagcctt 1320 

gttttttgca tctttctcac tcccatacat ttatattata tccccactaa atttcttgtt 1380 

cctcattctt caaatgtggg ccagttgtgg ctcaaatcct ctatattttt agccaatggc 1440 

aatcaaattc tttcagctcc tttgtttcat acggaactcc agatcctgag taatcctttt 1500 

agagcccgaa gagtcaaaac cctcaatgtt ccctcctgct ctcctgcccc atgtcaacaa 1560 

atttcaggct aaggatgccc. cagacccagg gctctaacct tgtatgcggg caggcccagg 162 0 

gagcaggcag cagtgttctt cccctcagag tgacttgggg agggagaaat aggaggagac 1680 

gtccagctct gtcctctctt cctcactcct cccttcagtg tcctgaggaa caggactttc 174 0 

tccacattgt tttgtattgc aacattttgc attaaaagga aaatccactg ctaaaaaaaa 1800 

aaaaaaaaaa aaaaaaaaaa agggcggccg c 1831 

<210> 118 
<211> 242 
<212> PRT 

<213> Mus musculus 
<400> 118 

Met Ar,g Pro Leu Leu Ala Leu Leu Leu Leu Gly Leu Val Ser Gly Ser 
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1 

1 




5 










10 










15 




Piro 


Pro 


T All A v\ A r+ 

iieu ASp ASp 


Asn 


Lys 


Ti- 
ne 


Pro 


Ser 


Leu 


Cys 


Pro 


Giy 


p i — . 
Gin 


Pro 






20 








25 










30 






\j ly 


Leu 


fro Giy inr 


Pro 


Giy 


HIS 


HIS 


pi,. 
Giy 


Ser 


p n « 
Cain 


Giy 


Leu 


Pro 


Gly 






35 






40 










45 










Asp 


v?iy Airg Asp 


L>iy 


Arg 


Asp 


uiy 


Aia 


Pro 


oiy 


Aia 


Pro 


Giy 


GlU 




c a 






55 










^ A 

60 










Lys 


biy 


vjiu oiy biy 


Arg 


Pro 


Giy 


Leu 


Pro 


Giy 


Pro 


Arg 


p i . _ 
Giy 


G1U 


Pro 


oo 






70 










o c 

75 










80 


Giy 


Piro 


Arg Giy uiu 


Giy 


Pro 


Ma 4- 

Meu 


Giy 


A 1 -a 

Aia 


lie 


biy 


Pro 


Aia 


Giy 


GlU 






85 










90 










95 




Cys 


Ser 


Val Pro Pro 


A -*-/-r 

Arg 


Ser 


Aia 


vne 


Ser 


Ala 


iiys 


Arg 


Ser 


GlU 


Ser 






100 








105 










110 






Arg 


vai 


Pro Pro Pro 


A 1 -» 

Ala 


Asp 


Thr 


Pro 


Leu 


Pro 


Pne 


Asp 


Arg 


Val 


Leu 






115 






120 










125 








Leu 


Asn 


biu Gin Giy 


HIS 


Tyr 


ASp 


Pro 


Tnr 


Tnr 


ni , , 

Gly 


Lys 


pne 


Tnr 


Cys 




1 A 

13 0 






135 










140 












val 


flTO uly vdl 


Tyr 


Tyr 


irne 


Aia 


Vdl 


TT -j — 

HIS 


Aia 


Tnr 


vai 


Tyr 


Arg 


1 /I c 

145 






1 C A 

150 










ICC 

155 










160 


Ala 


Ser 


lieu vain trie 


Asp 


Leu 


vai 


Lys 


TV nm 

Asn 


Giy 


Gin 


Ser 


lie 


A 1 -a 

Ala 


Ser 






165 










170 










175 






Fiie 


Gin Tyr Phe 


Giy 


Giy 


Trp 


Pro 


T , , — 

iiys 


Pro 


»i , 
Aia 


Ser 


Leu 


Ser 


Gly 






180 








185 










190 






Gly 


Ala 


Met Val Arg 


Leu 


Glu 


Pro 


pi„ 
Glu 


Asp 


Gin 


Val 


Trp 


Val 


p T _ 

Gin 


Val 






195 






200 










205 








Gly 


Val 


Gly Asp Tyr 


He 


Gly 


He 


Tyr 


Ala 


Ser 


He 


Lys 


Thr 


Asp 


Ser 




210 






215 










220 










Thr 


Phe 


Ser Gly Phe 


Leu 


Val 


Tyr 


Ser 


Asp 


Trp 


His 


Ser 


Ser 


Pro 


Val 


225 






230 










235 










240 



Phe Ala 



<210> 119 

<211> 1263 

<212> DNA 

<213> Mus musculus 



<400> 119 

gtcgacccac gcgtccgcgc tgtgaagcca gcaaggagca accagaagct aggagtcagt 60 

cagcaaggac aggggctgcc tgcctacaga ctacaagaga ggttcctgga gtctgagcct 12 0 

ccggggtcac caccatgagg ccacttcttg cccttctgct tctgggtctg gtgtcaggct 180 

ctcctcctct ggacgacaac aagatcccca gcctgtgtcc cgggcagccc ggccttccag 240 

gcacaccagg tcaccatggc agccaaggcc tgcctggccg tgacggccgt gatggccgcg 3 00 

acggtgcacc cggagctccg ggagagaaag gcgagggcgg gagaccggga ctacctggcc 360 

cacgtgggga gcccgggccg cgtggagagg tagggcccat gggggctatc gggcctgcgg 420 

gggagtgctc ggtaccccca cgatcagcct tcagtgccaa gcgatccgag agccgggtac 480 

ctccgccagc cgacacaccc ctacctttcg accgtgtgct gctaaatgag cagggccatt 540 

acgaccccac tactggcaag ttcacctgcc aagtgcctgg cgtctactac tttgctgtgc 600 

acgccactgt ctaccgggcc agcttgcagt ttgatcttgt caaaaacggg cagtccatcg 660 

cctctttctt ccagtatttt ggggggtggc ccaagccagc ctcgctctca gggggtgcga 720 

tggtaaggct agaacctgag gaccaggtgt gggtgcaggt gggcgtgggt gattacattg 780 

gcatctatgc cagcatcaag acagacagta ccttctctgg atttctcgtc tattctgact 840 

ggcacagctc cccagtcttc gcttaaaaca cagtgaaccc ggagctggca cttgctcctc 900 

agtggagggt gtgacactaa cccgcgcagc gcataccagg agggctggcc ccctggaata 960 

ttgtgaatga cttaggaaga gagggagcca cttccagtcc cactgctggc aatgaatgga 102 0 

gacaggctgt ctgaggtcaa gacagcgtgg agcagtggct gggtttctgc ccaggacttt 1080 

agaatgcagt aggctggcag ctgtgggtcc tggcccagga ctccaaggtg ggatgctcca 1140 

ttcctagtcc tgtgtcccct ctaggtccct gactccatct ctgctgctcc cagggcaggc 1200 
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ctttttctca gaggtcactt aataaaccta aaatcctcaa aaaaaaaaaa aaagggcggc 1260 
C 9C ^ 1263 

<210> 120 

<211> 243 

<212> PRT 

<213> Mus musculus 



<400> 120 



Met Arg Pro 


Leu 


Leu 


Ala 


Leu 


Leu Leu 


Leu Gly 


Leu 


Val 


Ser 


Gly 


Ser 


1 




5 








10 








15 




Pro Pro Leu 


Asp 


Asp 


Asn 


Lys 


He Pro 


Ser Leu 


Cys 


Pro 


Gly 


Gin 


Pro 




20 








25 








30 






Gly Leu Pro 


Gly 


Thr 


Pro 


Gly 


His His 


Gly Ser 


Gin 


Gly 


Leu 


Pro 


Gly 


35 










40 






45 








Arg Asp Gly 


Arg 


Asp 


Gly 


Arg 


Asp Gly 


Ala Pro 


Gly 


Ala 


Pro 


Gly 


Glu 


50 








55 






60 










Lys Gly Glu 


Gly 


Gly 


Arg 


Pro 


Gly Leu 


Pro Gly 


Pro 


Arg 


Gly 


Glu 


Pro 


65 






70 






75 










80 


Gly Pro Arg 


Gly 


Glu 


Ala 


Gly 


Pro Met 


Gly Ala 


He 


Gly 


Pro 


Ala 


Gly 






85 








90 








95 




Glu Cys Ser 


Val 


Pro 


Pro 


Arg 


Ser Ala 


Phe Ser 


Val 


Lys 


Arq 


Ser 


Glu 




100 








. 105 








110 






Ser Arg Val 


Pro 


Pro 


Pro 


Ala 


Asp Thr 


Pro Leu 


Pro 


Phe 


Asp 


Arg 


Val 


115 










120 






125 








Leu Leu Asn 


Glu 


Gin 


Gly 


His 


Tyr Asp 


Pro Thr 


Thr 


Gly 


Lys 


Phe 


Thr 


130 








135 






140 










Cys Gin Val 


Pro 


Gly 


Val 


Tyr 


Tyr Phe 


Ala Val 


His 


Ala 


Thr 


Val 


Tyr 


145 






150 






155 










160 


Arg Ala Ser 


Leu 


Gin 


Phe 


Asp 


Leu Val 


Lys Asn 


Gly 


Gin 


Ser 


He 


Ala 






165 








170 








175 




Ser Phe Phe 


Gin 


Tyr 


Phe 


Gly 


Gly Trp 


Pro Lys 


Pro 


Ala 


Ser 


Leu 


Ser 




180 








185 








190 






Gly Gly Ala 


Met 


Val 


Arg 


Leu 


Glu Pro 


Glu Asp 


Gin 


Val 


Trp 


Val 


Gin 


195 










200 






205 








Val Gly Val 


Gly 


Asp 


Tyr 


He 


Gly He 


Tyr Ala 


Ser 


He 


Lys 


Thr 


Asp 


210 








215 






220 








Ser Thr Phe 


Ser 


Gly 


Phe 


Leu 


Val Tyr 


Ser Asp 


Trp 


His 


Ser 


Ser 


Pro 


225 






230 






235 










24 0 


Val Phe Ala 

























<210> 121 
<211> 1263 
<212> DNA 

<213> Mus musculus 
<400> 121 

gtcgacccac gcgtccgcgc tgtgaagcca gcaaggagca accagaagct aggagtcagt 60 

cagcaaggac aggggctgcc tgcctacaga ctacaagaga ggttcctgga gtctgagcct 120 

ccggggtcac caccatgagg ccacttcttg cccttctgct tctgggtctg gtgtcaggct 180 

ctcctcctct ggacgacaac aagatcccca gcctgtgtcc cgggcagccc ggccttccag 240 

gcacaccagg tcaccatggc agccaaggcc tgcctggccg tgacggccgt gatggccgcg 300 

acggtgcacc cggagctccg ggagagaaag gcgagggcgg gagaccggga ctacctggcc 360 

cacgtgggga gcccgggccg cgtggagagg cagggcccat gggggctatc gggcctgcgg 42 0 

gggagtgctc ggtaccccca cgatcagtct tcagtgccaa gcgatccgag agccgggtac 480 

ctccgccagc cgacacaccc ctacctttcg accgtgtgct gctaaatgag cagggccatt 540 

acgaccccac tactggcaag ttcacctgcc aagtgcctgg cgtctactac tttgctgtgc 600 
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acgccactgt ctaccgggcc agcttgcagt ttgatcttgt caaaaacggg cagtccatcg 660 

cctctttctt ccagtatttt ggggggtggc ccaagccagc ctcgctctca gggggtgcga 720 

tggtaaggct agaacctgag gaccaggtgt gggtgcaggt gggcgtgggt gattacattg 780 

gcatctatgc cagcatcaag acagacagta ccttctctgg atttctcgtc tattctgact 840 

ggcacagctc cccagtcttc gcttaaaaca cagtgaaccc ggagctggca cttgctcctc 900 

agtggagggt gtgacactaa cccgcgcagc gcataccagg agggctggcc ccctggaata 960 

ttgtgaatga cttaggaaga gagggagcca cttccagtcc cactgctggc aatgaatgga 1020 

gacaggctgt ctgaggtcaa gacagcgtgg agcagtggct gggtttctgc ccaggacttt 1080 

agaatgcagt aggctggcag ctgtgggtcc tggcccagga ctccaaggtg ggatgctcca 1140 

ttcctagtcc tgtgtcccct ctaggtccct gactccatct ctgctgctcc cagggcaggc 1200 

ctttttctca gaggtcactt aataaaccta aaatcctcaa aaaaaaaaaa aaagggcggc 1260 

cgc 1263 

<210> 122 
<211> 243 
<212> PRT 

<213> Mus musculus 



<400> 122 



Met Arg Pro 


Leu 


Leu 


Ala 


Leu 


Leu 


Leu 


Leu 


Gly 


Leu 


val 


Ser 


Gly 


Ser 


1 




5 










10 










15 




Pro Pro Leu 


Asp 


Asp 


Asn 


Lys 


He 


Pro 


Ser 


Leu 


Cys 


Pro 


Gly 


Gin 


Pro 




20 










25 










30 






Gly Leu Pro 


Gly 


Thr 


Pro 


Gly 


HIS 


His 


Gly 


Ser 


Gin 


Gly 


Leu 


Pro 


Gly 


35 










40 










45 








Arg Asp Gly 


Arg 


Asp 


Gly 


Arg 


Asp 


Gly 


Ala 


Pro 


Gly 


Ala 


Pro 


Gly 


Glu 


50 








55 










60 










Lys Gly Glu 


Gly 


Gly 


Arg 


Pro 


Gly 


Leu 


Pro 


Gly 


Pro 


Arg 


Gly 


Glu 


Pro 


65 






70 










75 










80 


Gly Pro Arg 


Gly 


Glu 


Ala 


Gly 


Pro 


Met 


Gly 


Ala 


He 


Gly 


Pro 


Ala 


Gly 






85 










90 










95 




Glu Cys Ser 


Val 


Pro 


Pro 


Arg 


Ser 


Ala 


Phe 


Ser 


Ala 


Lys 


Arg 


Ser 


Glu 




100 










105 










110 






Ser Arg Val 


Pro 


Pro 


Pro 


Ala 


Asp 


Thr 


Pro 


Leu 


Pro 


Phe 


Asp 


Arg 


Ala 


115 










120 










125 








Leu Leu Asn 


Glu 


Gin 


Gly 


His 


Tyr 


Asp 


Pro 


Thr 


Thr 


Gly 


Lys 


Phe 


Thr 


130 








135 










140 










Cys Gin Val 


Pro 


Gly 


Val 


Tyr 


Tyr 


Phe 


Ala 


Val 


His 


Ala 


Thr 


Val 


Tyr 


145 






150 










155 










160 


Arg Ala Ser 


Leu 


Gin 


Phe 


Asp 


Leu 


Val 


Lys 


Asn 


Gly 


Gin 


Ser 


He 


Ala 






165 










170 










175 




Ser Phe Phe 


Gin 


Tyr 


Phe 


Gly 


Gly 


Trp 


Pro 


Lys 


Pro 


Ala 


Ser 


Leu 


Ser 




180 










185 










190 






Gly Gly Ala 


Met 


Val 


Arg 


Leu 


Glu 


Pro 


Glu 


Asp 


Gin 


Val 


Trp 


Val 


Gin 


195 










200 










205 








Val Gly Val 


Gly 


Asp 


Tyr 


He 


Gly 


He 


Tyr 


Ala 


Ser 


He 


Lys 


Thr 


Asp 


210 








215 










220 










Ser. Thr Phe 


Ser 


Gly 


Phe 


Leu 


Val 


Tyr 


Ser 


Asp 


Trp 


His 


Ser 


Ser 


Pro 


225 






230 










235 










240 


Val Phe Ala 





























<210> 123 

<211> 1263 

<212> DNA 

<213> Mus musculus 

<400> 123 
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gtcgacccac gcgtccgcgc tgtgaagcca gcaaggagca accagaagct aggagtcagt 60 

cagcaaggac aggggctgcc tgcctacaga' ctacaagaga ggttcctgga gtctgagcct 120 

ccggggtcac caccatgagg ccacttcttg cccttctgct tctgggtctg gtgtcaggct 180 

ctcctcctct ggacgacaac aagatcccca gcctgtgtcc cgggcagccc ggccttccag 240 

gcacaccagg tcaccatggc agccaaggcc tgcctggccg tgacggccgt gatggccgcg 3 00 

acggtgcacc cggagctccg ggagagaaag gcgagggcgg gagaccggga ctacctggcc 360 

cacgtgggga gcccgggccg cgtggagagg cagggcccat gggggctatc gggcctgcgg 42 0 

gggagtgctc ggtaccccca cgatcagcct tcagtgccaa gcgatccgag agccgggtac 4 80 

ctccgccagc cgacacaccc ctacctttcg accgtgcgct gctaaatgag cagggccatt 54 0 

acgaccccac tactggcaag ttcacctgcc aagtgcctgg cgtctactac tttgctgtgc 600 

acgccactgt ctaccgggcc agcttgcagt ttgatcttgt caaaaacggg cagtccatcg 660 

cctctttctt ccagtatttt ggggggtggc ccaagccagc ctcgctctca gggggtgcga 720 

tggtaaggct agaacctgag gaccaggtgt gggtgcaggt gggcgtgggt gattacattg 780 

gcatctatgc cagcatcaag acagacagta ccttctctgg atttctcgtc tattctgact 840 

ggcacagctc cccagtcttc gcttaaaaca cagtgaaccc ggagctggca cttgctcctc 900 

agtggagggt gtgacactaa cccgcgcagc gcataccagg agggctggcc ccctggaata 960 

ttgtgaatga cttaggaaga gagggagcca cttccagtcc cactgctggc aatgaatgga 1020 

gacaggctgt ctgaggtcaa gacagcgtgg agcagtggct gggtttctgc ccaggacttt 1080 

agaatgcagt aggctggcag ctgtgggtcc tggcccagga ctccaaggtg ggatgctcca 1140 

ttcctagtcc tgtgtcccct ctaggtccct gactccatct ctgctgctcc cagggcaggc 1200 

ctttttctca gaggtcactt aataaaccta aaatcctcaa aaaaaaaaaa aaagggcggc 1260 

cgc ~ 1263 

<210> 124 
<211> 243 
<212> PRT 

<213> Mus musculus 
<400> 124 

Met Arg Pro Leu Leu Ala Leu Leu Leu Leu Gly Leu Val Ser Gly Ser 

15 10 15 

Pro Pro Leu Asp Asp Asn Lys lie Pro Ser Leu Cys Pro Gly Gin Pro 

20 25 30 

Gly Leu Pro Gly Thr Pro Gly His His Gly Ser Gin Gly Leu Pro Gly 

35 40 45 

Arg Asp Gly Arg Asp Gly Arg Asp Gly Ala Pro Gly Ala Pro Gly Glu 



Gly Pro Arg Gly Glu Ala Gly Pro Met Gly Ala He Gly Pro Ala Gly 

85 90 95 

Glu Cys Ser Val Pro Pro Arg Ser Ala Phe Ser Ala Lys Arg Ser Glu 

100 105 110 

Ser Arg Val Pro Pro Pro Ala Asp Thr Pro Leu Pro Phe Asp Arg Val 

115 120 125 

Leu Leu Asn Glu Gin Gly His Tyr Asp Pro Thr Thr Gly Lys Phe Thr 

130 135 140 

Cys Gin Val Pro Gly Val Tyr Tyr Phe Ala Val His Ala Thr Val Tyr 
145 150 155 160 

Arg Ala Ser Leu Gin Phe Asp He Val Lys Asn Gly Gin Ser He Ala 

165 170 175 

Ser Phe Phe Gin Tyr Phe Gly Gly Trp Pro Lys Pro Ala Ser Leu Ser 

180 185 190 

Gly Gly Ala Met Val Arg Leu Glu Pro Glu Asp Gin Val Trp Val Gin 

195 200 205 

Val Gly Val Gly Asp Tyr He Gly lie Tyr Ala Ser He Lys Thr Asp 

210 215 220 

Ser Thr Phe Ser Gly Phe Leu Val Tyr Ser Asp Trp His Ser Ser Pro 
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225 230 235 240 

Val Phe Ala 



<210> 125 

<211> 1263 

<212> DNA 

<213> Mus musculus 

<400> 125 

gtcgacccac gcgtccgcgc tgtgaagcca gcaaggagca accagaagct aggagtcagt 60 

cagcaaggac aggggctgcc tgcctacaga ctacaagaga ggttcctgga gtctgagcct 120 

ccggggtcac caccatgagg ccacttcttg cccttctgct tctgggtctg gtgtcaggct 180 

ctcctcctct ggacgacaac aagatcccca gcctgtgtcc cgggcagccc ggccttccag 240 

gcacaccagg tcaccatggc agccaaggcc tgcctggccg tgacggccgt gatggccgcg 300 

acggtgcacc cggagctccg ggagagaaag gcgagggcgg gagaccggga ctacctggcc 360 

cacgtgggga gcccgggccg cgtggagagg cagggcccat gggggctatc gggcctgcgg 420 

gggagtgctc ggtaccccca cgatcagcct tcagtgccaa gcgatccgag agccgggtac 4 80 

ctccgccagc cgacacaccc ctacctttcg accgtgtgct gctaaatgag cagggccatt 540 

acgaccccac tactggcaag ttcacctgcc aagtgcctgg cgtctactac tttgctgtgc 600 

acgccactgt ctaccgggcc agcttgcagt ttgatattgt caaaaacggg cagtccatcg 660 

cctctttctt ccagtatttt ggggggtggc ccaagccagc ctcgctctca gggggtgcga 720 

tggtaaggct agaacctgag gaccaggtgt gggtgcaggt gggcgtgggt gattacattg 780 

gcatctatgc cagcatcaag acagacagta ccttctctgg atttctcgtc tattctgact 840 

ggcacagctc cccagtcttc gcttaaaaca cagtgaaccc ggagctggca cttgctcctc 900 

agtggagggt gtgacactaa cccgcgcagc gcataccagg agggctggcc ccctggaata 960 

ttgtgaatga cttaggaaga gagggagcca cttccagtcc cactgctggc aatgaatgga 102 0 

gacaggctgt ctgaggtcaa gacagcgtgg agcagtggct gggtttctgc ccaggacttt 1080 

agaatgcagt aggctggcag ctgtgggtcc tggcccagga ctccaaggtg ggatgctcca 1140 

ttcctagtcc tgtgtcccct ctaggtccct gactccatct ctgctgctcc cagggcaggc 1200 

ctttttctca gaggtcactt aataaaccta aaatcctcaa aaaaaaaaaa aaagggcggc 1260 

cgc 1263 

<210> 126 
<211> 406 
<212> PRT 
<213> Mus musculus 

<400> 126 

Met Gly Pro Ser Ala Pro Leu Leu Leu Leu Phe Phe Leu Ser Trp Thr 

1 ' 5 10 15 

Gly Pro Leu Gin Gly Gin Gin His His Leu Val Glu Tyr Met Glu Arg 

20 25 30 

Arg Leu Ala Ala Leu Glu Glu Arg Leu Ala Gin Cys Gin Asp Gin Ser 

35 40 45 

Ser Arg His Ala Ala Glu Leu Arg Asp Phe Lys Asn Lys Met Leu Pro 

50 55 60 

Leu Leu Glu Val Ala Glu Lys Glu Arg Glu Thr Leu Arg Thr Glu Ala 
65 70 75 80 

Asp Ser He Ser Gly Arg Val Asp Arg He Glu Arg Glu Val Asp Tyr 

85 90 95 

Leu Glu Thr Gin Asn Pro Ala Leu Pro Cys Val Glu Leu Asp Glu Lys 

100 105 110 

Val Thr Gly Gly Pro Gly Ala Lys Gly Lys Gly Arg Arg Asn Glu Lys 

115 120 125 

Tyr Asp Met Val Thr Asp Cys Ser Tyr Thr Val Ala Gin Val Arg Ser 

130 135 140 

Met Lys He Leu Lys Arg Phe Gly Gly Ser Val Gly Leu Trp Thr Lys 
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145 










150 










155 










160 


Asp 


Pro 


Leu 


Gly Pro 


Ala 


Glu 


Lys 


He 


Tyr 


Val 


Leu 


Asp 


Gly 


Thr 


Gin 










165 










170 










175 




Asn 


Asp 


Thr 


Ala 
180 


Phe 


Val 


Phe 


Pro 


Arg 
185 


Leu 


Arg 


Asp 


Phe 


Thr 
190 


Leu 


Ala 


Met 


Ala 


Ala 
195 


Arg 


Lys 


Ala 


Ser 


Arg 
200 


He 


Arg 


Val 


Pro 


Phe 
205 


Pro 


Trp 


Val 


Gly 


Thr 
210 


Gly 


Gin 


Leu 


Val 


Tyr 
215 


Gly 


Gly 


Phe 


Leu 


Tyr 
220 


Tyr 


Ala 


Arg 


Arg 


Pro 


Pro 


Gly 


Gly 


Pro 


Gly 


Gly 


Gly 


Gly 


Glu 


Leu 


Glu 


Asn 


Thr 


Leu 


Gin 


225 










230 










235 










240 


Leu 


He 


Lys 


Phe 


His 
245 


Leu 


Ala 


Asn 


Arg 


Thr 
250 


Val 


Val 


Asp 


Ser 


Ser 
255 


Val 


Phe 


Pro 


Ala 


Glu 
260 


Ser 


Leu 


He 


Pro 


Pro 
265 


Tyr 


Gly 


Leu 


Thr 


Ala 
270 


Asp 


Thr 


Tyr 


He 


Asp 
275 


Leu 


Ala 


Ala 


Asp 


Glu 
280 


Glu 


Gly 


Leu 


Trp 


Ala 
285 


Val 


Tyr 


Ala 


Thr 


Arg 


Asp 


Asp Asp 


Arg 


His 


Leu 


Cys 


Leu 


Ala 


Lys 


Leu 


Asp 


Pro 


Gin 




290 










295 










300 










Thr 


Leu 


Asp 


Thr 


Glu 


Gin 


Gin 


Trp 


Asp 


Thr 


Pro 


Cys 


Pro 


Ar 9 


Glu 


Asn 


305 










310 










315 










320 


Ala 


Glu 


Ala 


Ala 


Phe 
325 


Val 


He 


Cys 


Gly 


Thr 
330 


Leu 


Tyr 


Val 


Val 


Tyr 
335 


Asn 


Thr 


Arg 


Pro 


Ala 
340 


Ser 


Arg 


Ala 


Arg 


He 
345 


Gin 


Cys 


Ser 


Phe 


Asp 
350 


Ala 


Ser 


Gly 


Thr 


Leu 
355 


Ala 


Pro 


Glu 


Arg 


Ala 
360 


Ala 


Leu 


Ser 


Tyr 


Phe 
365 


Pro 


Arg 


Arg 


Tyr 


Gly 
370 


Ala 


His 


Ala 


Ser 


Leu 
375 


Arg 


Tyr 


Asn 


Pro 


Arg 
380 


Glu 


Arg 


Gin 


Leu 


Tyr 


Ala 


Trp 


Asp Asp 


Gly 


Tyr 


Gin 


He 


Val 


Tyr 


Lys 


Leu 


Glu 


Met 


Lys 


385 










390 










395 










400 


Lys 


Lys 


Glu 


Glu 


Glu 


Val 























405 



<210> 127 
<211> 1721 
<212> DNA 

<213> Mus musculus 



<400> 127 

gtcgacccac gcgtccgact taaggctgcc atggggccca gtgctcctct gctgctcctc 60 

ttctttttgt catggacggg accccttcag ggacagcagc accaccttgt ggagtacatg 120 

gaacgccgac tagctgcctt agaggaacgg ctggcccaat gccaggatca gagtagtcgg 180 

catgctgccg agcttcggga cttcaaaaac aagatgttgc ctctcctgga ggtggcagag 24 0 

aaggagcggg agaccctcag aactgaagca gactccatct caggaagagt ggaccgtatt 300 

gaaagggagg tagactatct ggagacacag aacccagctt tgccctgtgt agagctggat 360 

gagaaggtga ctggaggtcc tggagccaaa ggcaagggcc gaagaaatga gaaatacgat 420 

atggtgacgg actgtagcta cacagtcgct caggtgaggt caatgaagat cctgaagcgg 480 

tttggtggtt cagttggcct atggaccaag gatccgctgg ggccagcaga gaagatctac 540 

gtgttagacg gcacccagaa cgacacggct tttgtcttcc caaggctgcg tgacttcacc 600 

cttgccatgg ctgcccggaa agcttcccga attcgggtgc ccttcccctg ggtaggcacg 660 

gggcagctgg tgtacggtgg cttcctttat tatgctcgaa ggcctcctgg aggacctgga 720 

99999tggtg aattggagaa cactctgcag ctgatcaaat ttcacttggc aaaccgaaca 780 

gtggtggata gctcagtgtt ccctgcagag agcctgatac ccccctacgg cctgacagca 84 0 

gatacatata tcgacctggc agctgatgag gagggcctgt gggctgtcta tgccactcga 900 

gatgatgaca ggcatttgtg tctagccaag ttagacccac agacacttga cacagagcag 960 

cagtgggaca caccatgtcc cagagagaac gcagaggctg cgtttgtcat ctgtgggacc 1020 

ctgtacgttg tctataacac ccgccctgcc agtagggctc gtattcagtg ttccttcgat 1080 
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gccagtggta ctctcgcccc tgaaagggca gcactctcct attttccacg ccgatatggt 1140 

gcccatgcca gccttcgcta taacccccgt gagcgccagc tgtatgcctg ggatgatggc 1200 

taccagattg tctacaaatt ggagatgaag aagaaggagg aggaagttta agcagctagc 12 60 

cttgtgctct tgattcttat gcccagacat ttatattcct gtgagctctc ctgcagttca 1320 

tccttcaaaa cgaaggccag tggtggtagc tcatataccc taatttctaa aggacaacca 1380 

aattctcaag cccctctgtt ttatgcagaa ctccagatcc tgggtagcat tttagaactg 1440 

aacagcaaac aaacacccta aatcttcact cctgccttat gtccacaaag tttagttcca 1500 

aactcagagc cctgtccttt ggagagggtc aaccccagac agcaggcgac agcattcttg 1560 

ccctcagtat gaccgaaggg agagaactca gagacaaagc tgccctccct cccttccccc 1620 

tccagtgtag gggagaatgg ggctttcccc acatcacttt gtatggtaac agtttgcatt 1680 

aaaaggaaaa cccaccaaaa aaaaaaaaaa agggcggccg c 1721 

<210> 128 
<211> 406 
<212> PRT 

<213> Mus musculus 



<400> 128 



riec 


vjiy 


Pro 


Ser 


Aia 


Pro 


Leu 


Leu 


Leu 


T an 

Leu 


Pne 


Phe Leu Ser Trp Thr 


1 








b 










n A 

10 




15 


ijiy 


Pro 


Leu 


vjin 


vjiy 


vjin 


bin 


rllS 


xilS 


T an 

Leu 


vai 


Glu Tyr Met Glu Arg 








2 0 










25 






30 


Arg 


Leu 


M.-L cl 


Ala 


Leu 


pin 


pin 

LjIU 


Arg 


Leu 


Aia 


bin 


Cys Gin Asp Gin Ser 






3b 










40 








45 


S6r 


Arg 


rll S 


Ala 
Aid 


Aia 


vjIU 


Leu 


Arg 


ASp 


T-)1_ n 

pne 


Lys 


Asn Lys Met Leu Pro 




b U 










55 










60 


Leu 


Leu 


VjlU 


vai 


Aia 


/"•lit 
VjIU 


Lys 


pin 

CjIU 


Arg 


GlU 


Thr 


Leu Arg Thr Glu Ala 


b b 










70 










75 


80 


nop 


Gov* 


1 JLc 


Ser 




Arg 


vai 


Asp 


Arg 


Leu 


blU 


Arg Glu Val Asp Tyr 










ob 










A A 

90 




95 


Leu. 




Thr 


n 


Asn 


Pro 


TV 1 - 

Aia 


T an 

iieu 


Pro 


Cys 


vai 


Glu Leu Asp Glu Lys 








100 










105 






110 


vctX 


X IIL 


o± y 


o±y 


Pro 


o.Ly 


Ala 
Rid 


Lys 


oiy 


Lys 


vjiy 


Arg Arg Asn Glu Lys 






TIC 

lib 










1 "i A 

120 








125 


Tyr 


Asp 


lie 


Vdl 


inr 


Asp 


Cys 


Ser 


Tyr 


inr 


i 7_ "J 

vai 


Ala Gin Val Arg Ser 




130 










135 










140 


Met 


Lys 


He 


Leu 


Lys 


Arg 


Phe 


Gly 


Gly 


Ser 


Val 


Gly Leu Trp Thr Lys 


145 










150 










155 


160 


Asp 


Pro 


Leu 


Gly 


Pro 


Ala 


Glu 


Lys 


He 


Tyr 


Val 


Leu Asp Gly Thr Gin 










165 










170 




175 


Asn 


Asp 


Thr 


Ala 


Phe 


Val 


Phe 


Pro 


Arg 


Leu 


Arg 


Asp Phe Thr Leu Ala 








180 










185 






190 


Met 


Ala 


Ala 


Arg 


Lys 


Ala 


Ser 


Arg 


He 


Arg 


Val 


Pro Phe Pro Trp Val 






195 










200 








205 


Gly 


Thr 


Gly 


Gin 


Leu 


Val 


Tyr 


Gly 


Gly 


Phe 


Leu 


Tyr Tyr Ala Arg Arg 




210 










215 










220 


Pro 


Pro 


Gly 


Gly 


Pro 


Gly 


Gly 


Gly 


Gly 


Glu 


Leu 


Glu Asn Thr Leu Gin 


225 










230 










235 


240 


Leu 


lie 


Lys 


Phe 


His 


Leu 


Ala 


Asn 


Arg 


Thr 


Val 


Val Asp Ser Ser Val 










245 










250 




255 


Phe 


Pro 


Ala 


Glu 


Ser 


Leu 


lie 


Pro 


Pro 


Tyr 


Gly 


Leu Thr Ala Asp Thr 








260 










265 






270 


Tyr 


He 


Asp 


Leu 


Ala 


Ala 


Asp 


Glu 


Glu 


Gly 


Leu 


Trp Ala Val Tyr Ala 






275 










280 








285 


Thr 


Arg 


Asp 


Asp 


Asp 


Arg 


His 


Leu 


Cys 


Leu 


Ala 


Lys Leu Asp Pro Gin 




290 










295 










300 


Thr 


Leu 


Asp 


Thr 


Glu 


Gin 


Gin 


Trp 


Asp 


Thr 


Pro 


Cys Pro Arg Glu Asn 


305 










310 










315 


320 



79 



WO 00/78808 



PCT/US00/16883 



Ala Glu Ala Ala 


Phe 
325 


Val 


He 


Cys 


Thr Arg Pro Ala 


Ser 


Arg 


Ala 


Arg 


340 










Gly. Thr Leu Ala 


Pro 


Glu 


Arg 


Ala 


355 








360 


Tyr Gly Ala His 


Ala 


Ser 


Leu 


Arg 


370 






375 




Tyr Ala Trp Asp 


Asp 


Gly 


Tyr 


Gin 


385 




390 






Lys Lys Glu Glu 


Glu 
405 


Val 







Gly Thr 


Leu 


Tyr 


Val 


Val 


Tyr Asn 


330 










335 


He Gin 


Cys 


Ser 


Phe 


Asp 


Ala Ser 


345 








350 




Ala Leu 


Ser 


Tyr 


Phe 


Pro 


Arg Arg 








365 






Tyr Asn 


Pro 


Arg 


Glu Arg 


Gin Leu 






380 








He Val 


Tyr 


Lys 


Leu 


Glu 


Met Lys 




395 








400 



<210> 129 

<211> 1721 

<212> DNA 

<213> Mus musculus 



<400> 129 

gtcgacccac gcgtccgact taaggctgcc atggggccca gtgctcctct gctgctcctc 60 

ttctttttgt catggacggg accccttcag ggacagcagc accaccttgt ggagtacatg 120 

gaacgccgac tagctgcctt agaggaacgg ctggcccaat gccaggatca gagtagtcgg 180 

catgctgccg agcttcggga cttcaaaaac aagatgttgc ctctcctgga ggtggcagag 240 

aaggagcggg agaccctcag aactgaagca gactccatct caggaagagt ggaccgtctt 300 

gaaagggagg tagactatct ggagacacag aacccagctt tgccctgtgt agagctggat 3 60 

gagaaggtga ctggaggtcc tggagccaaa ggcaagggcc gaagaaatga gaaatacgat 420 

atagtgacgg actgtagcta cacagtcgct caggtgaggt caatgaagat cctgaagcgg 4 80 

tttggtggtt cagttggcct atggaccaag gatccgctgg ggccagcaga gaagatctac 54 0 

gtgttagacg gcacccagaa cgacacggct tttgtcttcc caaggctgcg tgacttcacc 600 

cttgccatgg ctgcccggaa agcttcccga attcgggtgc ccttcccctg ggtaggcacg 660 

gggcagctgg tgtacggtgg cttcctttat tatgctcgaa ggcctcctgg aggacctgga 72 0 

99999t9gtg aattggagaa cactctgcag ctgatcaaat ttcacttggc aaaccgaaca 780 

gtggtggata gctcagtgtt ccctgcagag agcctgatac ccccctacgg cctgacagca 84 0 

gatacatata tcgacctggc agctgatgag gagggcctgt gggctgtcta tgccactcga 900 

gatgatgaca ggcatttgtg tctagccaag ttagacccac agacacttga cacagagcag 960 

cagtgggaca caccatgtcc cagagagaac gcagaggctg cgtttgtcat ctgtgggacc 1020 

ctgtacgttg tctataacac ccgccctgcc agtagggctc gtattcagtg ttccttcgat 1080 

gccagtggta ctctcgcccc tgaaagggca gcactctcct attttccacg ccgatatggt 1140 

gcccatgcca gccttcgcta taacccccgt gagcgccagc tgtatgcctg ggatgatggc 1200 

taccagattg tctacaaatt ggagatgaag aagaaggagg aggaagttta agcagctagc 1260 

cttgtgctct tgattcttat gcccagacat ttatattcct gtgagctctc ctgcagttca 1320 

tccttcaaaa cgaaggccag tggtggtagc tcatataccc taatttctaa aggacaacca 1380 

aattctcaag cccctctgtt ttatgcagaa ctccagatcc tgggtagcat tttagaactg 1440 

aacagcaaac aaacacccta aatcttcact cctgccttat gtccacaaag tttagttcca 1500 

aactcagagc cctgtccttt ggagagggtc aaccccagac agcaggcgac agcattcttg 1560 

ccctcagtat gaccgaaggg agagaactca gagacaaagc tgccctccct cccttccccc 1620 

tccagtgtag gggagaatgg ggctttcccc acatcacttt gtatggtaac agtttgcatt 1680 

aaaaggaaaa cccaccaaaa aaaaaaaaaa agggcggccg c 1721 

<210> 130 
<211> 406 
<212> PRT 

<213> Mus musculus 



<400> 130 

Met Gly Pro Ser Ala Pro Leu Leu Leu Leu Phe Phe Leu Ser Trp Thr 

15 10 15 

Gly Pro Leu Gin Gly Gin Gin His His Leu Val Glu Tyr Met Glu Arg 
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20 








25 










30 






Arg Leu Ala Ala 


Leu 


Glu 


Glu 


Arg Leu 


Ala 


Gin 


Cys 


Gin 


Asp 


Gin 


Ser 


35 








40 








45 








Ser Arg His Ala 


Ala 


Glu 


Leu 


Arg Asp 


Phe 


Lys 


Asn 


Lys 


Met 


Leu 


Pro 


50 






55 








60 










Leu Leu Glu Val 


Ala 


Glu 


Lys 


Glu Arg 


Glu 


Thr 


Leu 


Arg 


Thr 


Glu 


Ala 


65 




70 








75 










80 


Asp Ser lie Ser 


Gly 
85 


Arg 


Val 


Asp Arg 


Leu 
90 


Glu 


Arg 


GlU 


vai 


Asp 
95 


Tyr 


Leu Glu Thr Gin 


Asn 


Pro 


Ala 


Leu Pro 


Cys 


Val 


Glu 


Leu 


Asp 


Glu 


Lys 


100 








105 










110 






Val Thr Gly Gly 


Pro 


Gly 


Ala 


Lys Gly 


Lys 


Gly 


Arg 


Arg 


Asn 


Glu 


Lys 


115 








120 








125 








Tyr Asp Met Val 


Thr 


Asp 


Cys 


Ser Tyr 


Thr 


Val 


Ala 


Gin 


Val 


Arg 


Ser 


130 






135 








140 










Met Lys lie Leu 


Lys 


Arg 


Phe 


Gly Gly 


Ser 


Val 


Gly 


Leu 


Trp 


Thr 


Lys 


145 




150 








155 










160 


Asp Pro Leu Gly 


Pro 


Ala 


Glu 


Lys lie 


Tyr 


Ala 


Leu 


Asp 


Gly Thr 


Gin 




165 








170 










175 




Asn Asp Thr Ala 


Phe 


Val 


Phe 


Pro Arg 


Leu 


Arg 


Asp 


Pne 


Thr 


Leu 


Ala 


180 








185 










190 






Met Ala Ala Arg 


Lys 


Ala 


Ser 


Arg lie 


Arg 


Val 


Pro 


Phe 


Pro 


Trp 


Val 


195 








200 








205 








Gly Thr Gly Gin 


Leu 


Val 


Tyr 


Gly Gly 


Phe 


Leu 


Tyr 


Tyr 


Ala Arg 


Arg 


210 






215 








220 










Pro Pro Gly Gly 


Pro 


Gly 




Gly Gly 


Glu 


Leu 


Glu 


Asn 


Thr 


Leu 


Gin 


225 




230 








235 










240 


Leu lie Lys Phe 


His 
245 


Leu 


Ala 


Asn Arg 


Thr 
250 


Val 


Val 


Asp 


Ser 


Ser 
255 


Val 


Phe Pro Ala Glu 


Ser 


Leu 


lie 


Pro Pro 


Tyr 


Gly 


Leu 


Thr 


Ala Asp 


Tnr 


260 








265 










270 






Tyr lie Asp Leu 


Ala 


Ala 


Asp 


Glu Glu 


Gly 


Leu 


Trp 


Ala 


Val 


Tyr 


Ala 


275 








280 








285 








Thr Arg Asp Asp 


Asp 


Arg 


His 


Leu Cys 


Leu 


Ala 


Lys 


Leu 


Asp 


Pro 


Gin 


290 






295 








300 










Thr Leu Asp Thr 


Glu 


Gin 


Gin 


Trp Asp 


Thr 


Pro 


Cys 


Pro 


Arg Glu 


Asn 


305 




310 








315 










320 


Ala Glu Ala Ala 


Phe 
325 


Val 


lie 


Cys Gly 


Thr 
330 


Leu 


Tyr 


Val 


Val 


Tyr 
335 


Asn 


Thr Arg Pro Ala 


Ser 


Arg 


Ala 


Arg lie 


Gin 


Cys 


Ser 


Phe 


Asp 


Ala 


Ser 


340 








345 










350 






Gly Thr Leu Ala 


Pro 


Glu 


Arg 


Ala Ala 


Leu 


Ser 


Tyr 


Phe 


Pro 


Arg 


Arg 


355 








360 








365 








Tyr Gly Ala His 


Ala 


Ser 


Leu 


Arg Tyr 


Asn 


Pro 


Arg 


Glu 


Arg Gin 


Leu 


370 






375 








380 










Tyr Ala Trp Asp 


Asp 


Gly 


Tyr 


Gin He 


Val 


Tyr 


Lys 


Leu 


Glu 


Met 


Lys 


385 




390 








395 










400 


Lys Lys Glu Glu 


Glu 
405 


Val 





















<210> 131 
<211> 1721 
<212> DNA 

<213> Mus musculus 
<400> 131 

gtcgacccac gcgtccgact taaggctgcc atggggccca gtgctcctct gctgctcctc 60 
ttctttttgt catggacggg accccttcag ggacagcagc accaccttgt ggagtacatg 120 
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gaacgccgac tagctgcctt agaggaacgg ctggcccaat gccaggatca gagtagtcgg 180 

catgctgccg agcttcggga cttcaaaaac aagatgttgc ctctcctgga ggtggcagag 24 0 

aaggagcggg agaccctcag aactgaagca gactccatct caggaagagt ggaccgtctt 300 

gaaagggagg tagactatct ggagacacag aacccagctt tgccctgtgt agagctggat 360 

gagaaggtga ctggaggtcc tggagccaaa ggcaagggcc gaagaaatga gaaatacgat 42 0 

at ggtgacgg actgtagcta cacagtcgct caggtgaggt caatgaagat cctgaagcgg 4 80 

tttggtggtt cagttggcct atggaccaag gatccgctgg ggccagcaga gaagatctac 540 

gcgttagacg gcacccagaa cgacacggct tttgtcttcc caaggctgcg tgacttcacc 600 

cttgccatgg ctgcccggaa agcttcccga attcgggtgc ccttcccctg ggtaggcacg 660 

gggcagctgg tgtacggtgg cttcctttat tatgctcgaa ggcctcctgg aggacctgga 720 

gggggtggtg aattggagaa cactctgcag ctgatcaaat ttcacttggc aaaccgaaca 780 

gtggtggata gctcagtgtt ccctgcagag agcctgatac ccccctacgg cctgacagca 840 

gatacatata tcgacctggc agctgatgag gagggcctgt gggctgtcta tgccactcga 900 

gatgatgaca ggcatttgtg tctagccaag ttagacccac agacacttga cacagagcag 960 

cagtgggaca caccatgtcc cagagagaac gcagaggctg cgtttgtcat ctgtgggacc 102 0 

ctgtacgttg tctataacac ccgccctgcc agtagggctc gtattcagtg ttccttcgat 1080 

gccagtggta ctctcgcccc tgaaagggca gcactctcct attttccacg ccgatatggt 1140 

gcccatgcca gccttcgcta taacccccgt gagcgccagc tgtatgcctg ggatgatggc 1200 

taccagattg tctacaaatt ggagatgaag aagaaggagg aggaagttta agcagctagc 1260 

cttgtgctct tgattcttat gcccagacat ttatattcct gtgagctctc ctgcagttca 1320 

tccttcaaaa cgaaggccag tggtggtagc tcatataccc taatttctaa aggacaacca 1380 

aattctcaag cccctctgtt ttatgcagaa ctccagatcc tgggtagcat tttagaactg 1440 

aacagcaaac aaacacccta aatcttcact cctgccttat gtccacaaag tttagttcca 1500 

aactcagagc cctgtccttt ggagagggtc aaccccagac agcaggcgac agcattcttg 1560 

ccctcagtat gaccgaaggg agagaactca gagacaaagc tgccctccct cccttccccc 1620 

tccagtgtag gggagaatgg ggctttcccc acatcacttt gtatggtaac agtttgcatt 1680 

aaaaggaaaa cccaccaaaa aaaaaaaaaa agggcggccg c 1721 

<210> 132 
<211> 406 
<212> PRT 

<213> Mus musculus 



<400> 132 
Met Gly Pro Ser 
1 

Gly Pro Leu Gin 
20 

Arg Leu Ala Ala 
35 

Ser Arg His Ala 
50 

Leu Leu Glu Val 
65 

Asp Ser lie Ser 

Leu Glii Thr Gin 
100 

Val Thr Gly Gly 
115 

Tyr Asp Met Val 
130 

Met Lys He Leu 
145 

Asp Pro Leu Gly 

Asn Asp Thr Ala 
180 



Ala 


Pro 


Leu 


Leu 


5 








Gly 


Gin 


Gin 


His 


Leu 


Glu 


Glu 


Arg 








40 


Ala 


Glu 


Leu 


Arg 






55 




Ala 


Glu 


Lys 


Glu 




70 






Gly 


Arg 


Val 


Asp 


85 








Asn 


Pro 


Ala 


Leu 


Pro 


Gly 


Ala 


Lys 








120 


Thr 


Asp 


Cys 


Ser 






135 




Lys 


Arg 


Phe 


Gly 




150 






Pro 


Ala 


Glu 


Lys 


165 








Phe 


Val 


Phe 


Pro 



Leu 


Leu 


Phe 


Phe 










His 


Leu 


Val 


Glu 


25 








Leu 


Ala 


Gin 


Cys 


Asp 


Phe 


Lys 


Asn 








60 


Arg 


Glu 


Thr 


Leu 






75 




Arg 


Leu 


Glu 


Arg 




90 






Pro 


Cys 


Val 


Glu 


105 








Gly 


Lys 


Gly 


Arg 


Tyr 


Thr 


Val 


Ala 








140 


Gly 


Ser 


Val 


Gly 






155 




He 


Tyr 


Val 


Leu 




170 






Arg 


Leu 


Arg 


Asp 



185 



Leu Ser Trp Thr 
15 

Tyr Met Glu Arg 
30 

Gin Asp Gin Ser 
45 

Lys Met Leu Pro 

Arg Thr Glu Ala 
80 

Glu Val Asp Tyr 
95 

Leu Asp Glu Lys 
110 

Arg Asn Glu Lys 
125 

Gin Val Arg Ser 

Leu Trp Thr Lys 
160 

Asp Gly Thr Gin 
175 

Phe Thr Leu Val 
190 
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Met 


Ala 


Ala Arg 


u y o 


Ala 










195 










T A A 

z u u 


\jxy 


Thr 


Gly Gin 


Leu 


vai 


Tyr 


oiy 




z i u 










"5 1 C 




FLO 


Pro 


Gly Gly 


Pro 


L>J.y 


vjiy 


Gly 


Z Z D 










230 








Tl pa 
1 1 c 


Lys 


Phe 


nis 


Leu 


Ala 
Aid 


Asn 










"5 A C 












Ala 


Glu 


Co ■*» 
del 




i le 


Pro 








*5 C f\ 














Asp 


Leu 


Ala 
Ala 




Asp 


pi,. 






275 










*"> O A 

2 80 






Asp 


Asp 


7A ear"* 
Atop 


Arg 


ni S 


Leu 














ZyD 




Thr 


Leu 


Asp 


Thr 


m n 


Uill 


m n 

Vj 111 


irp 












310 






Ala 


bill 


Ala 


Ala 


nU Q 
rae 


vai 


lie 


Cys 










325 








Thy 

1 HIT 


Arg 


Pro 


Ala 


Ser 


Arg 


Aia 


Arg 








340 










Glv 


Thr 


Leu 


Ala 


riu 


VJ-L U. 


Arg 


Ala 
Ala 






355 










3 6 0 


Tvr 

2 


Glv 


Ala 


His 


A1 a 

r\JL Cl 


OCi 


T.oi i 
■UCU 


Arg 




370 










375 




Tyr 


Ala 


Trp Asp 


Asp 


Gly 


Tyr 


Gin 


385 










390 






Lys 


Lys 


Glu 


Glu 


Glu 


Val 







405 



He 


Arg 


Val 


Pro 


Phe 

1 A C 

205 


Pro 


Trp 


Val 


Gly 


Phe 


Leu 


Tyr 

^ A A 

2 20 


Tyr 


Ala 


Arg 


Arg 


oiy 


VjIU 


Leu 
235 


G1U 


Asn 


Tnr 


Leu 


Gin 
240 


Arg 


Thr 
250 


Val 


Val 


Asp 


Ser 


Ser 
255 


Val 


Pro 


Tyr 


Gly 


Leu 


Thr 


Ala 


Asp 


Thr 


265 










270 






Glu 


Gly 


Leu 


Trp 


Ala 

o O C 

2 85 


Val 


Tyr 


Ala 


Cys 


Leu 


Ala 


Lys 
3 00 


Leu 


Asp 


Pro 


Gin 


Asp 


Thr 


Pro 
315 


Cys 


Pro 


Arg 


n *i « * 

GlU 


Asn 
320 


Gly 


Thr 
330 


Leu 


Tyr 


Val 


Val 


Tyr 
335 


Asn 


He 


Gin 


Cys 


Ser 


Phe 


Asp 


Ala 


Ser 


345 










350 






Ala 


Leu 


Ser 


Tyr 


Phe 
365 


Pro 


Arg 


Arg 


Tyr 


Asn 


Pro 


Arg 
380 


Glu 


Arg 


Gin 


Leu 


He 


Val 


Tyr 
395 


Lys 


Leu 


Glu 


Met 


Lys 
400 



<210> 133 

<211> 1721 

<212> DNA 

<213> Mus musculus 

<400> 133 

gtcgacccac gcgtccgact taaggctgcc atggggccca gtgctcctct gctgctcctc 60 

ttctttttgt catggacggg accccttcag ggacagcagc accaccttgt ggagtacatg 120 

gaacgccgac tagctgcctt agaggaacgg ctggcccaat gccaggatca gagtagtcgg 180 

catgctgccg agcttcggga cttcaaaaac aagatgttgc ctctcctgga ggtggcagag 240 

aa 99 a 9 c 999 agaccctcag aactgaagca gactccatct caggaagagt ggaccgtctt 300 

gaaagggagg tagactatct ggagacacag aacccagctt tgccctgtgt agagctggat 360 

gagaaggtga ctggaggtcc tggagccaaa ggcaagggcc gaagaaatga gaaatacgat 42 0 

atggtgacgg actgtagcta cacagtcgct caggtgaggt caatgaagat cctgaagcgg 480 

tttggtggtt cagttggcct atggaccaag gatccgctgg ggccagcaga gaagatctac 540 

gtgttagacg gcacccagaa cgacacggct tttgtcttcc caaggctgcg tgacttcacc 600 

cttgtcatgg ctgcccggaa agcttcccga attcgggtgc ccttcccctg ggtaggcacg 660 

gggcagctgg tgtacggtgg cttcctttat tatgctcgaa ggcctcctgg aggacctgga 72 0 

99999tggtg aattggagaa cactctgcag ctgatcaaat ttcacttggc aaaccgaaca 780 

gtggtggata gctcagtgtt ccctgcagag agcctgatac ccccctacgg cctgacagca 840 

gatacatata tcgacctggc agctgatgag gagggcctgt gggctgtcta tgccactcga 900 

gatgatgaca ggcatttgtg tctagccaag ttagacccac agacacttga cacagagcag 960 

cagtgggaca caeca tgtcc cagagagaac gcagaggctg cgtttgtcat ctgtgggacc 102 0 

ctgtacgttg tctataacac ccgccctgcc agtagggctc gtattcagtg ttccttcgat 1080 

gccagtggta ctctcgcccc tgaaagggca gcactctcct attttccacg ccgatatggt 1140 

gcccatgcca gccttcgcta taacccccgt gagcgccagc tgtatgcctg ggatgatggc 1200 

taccagattg tctacaaatt ggagatgaag aagaaggagg aggaagttta agcagctagc 1260 

cttgtgctct tgattcttat gcccagacat ttatattcct gtgagctctc ctgcagttca 1320 

tccttcaaaa egaaggecag tggtggtagc tcatataccc taatttctaa aggacaacca 1380 
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aattctcaag cccctctgtt ttatgcagaa ctccagatcc tgggtagcat tttagaactg 1440 

aacagcaaac aaacacccta aatcttcact cctgccttat gtccacaaag tttagttcca 1500 

aactcagagc cctgtccttt ggagagggtc aaccccagac agcaggcgac agcattcttg 1560 

ccctcagtat gaccgaaggg agagaactca gagacaaagc tgccctccct cccttccccc 1620 

tccagtgtag gggagaatgg ggctttcccc acatcacttt gtatggtaac agtttgcatt 1680 

aaaaggaaaa cccaccaaaa aaaaaaaaaa agggcggccg c 1721 

<210> 134 
<211> 370 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (13) 

<223> Xaa=unknown amino acid 



<400> 134 



Met 


He 


Ser 


Leu 


Pro 


Gly 


Pro 


Leu 


Val 


Thr 


Asn 


Leu 


Xaa Arg 


Phe 


Leu 


1 








5 










10 










15 




Phe 


Leu 


Gly 


Leu 


Ser 


Ala 


Leu 


Ala 


Pro 


Pro 


Ser 


Ara 

"x y 


Ala 


Gin 


Leu 


Gin 








20 










25 










30 






Leu 


His 


Leu 


Pro 


Ala 


Asn 


Arcr 


Leu 


Gin 


Ala 


Val 


Glu 


Glu 


Gly Glu 


Ser 






35 










40 










45 








Gly 


Ala 


Ser 


Ala 


TrD 


Tvr 


Thr 


Leu 


His 


Ara 

r\x y 


Glu 


Ala 


Ser 


Ser 


Ser 


Gin 




50 










55 










60 










Pro 


Tro 


Glu 


Val 


Pro 


Phe 


Val 


Met 


Tro 


Phe 


Phe 


Lys 


Gin 


Lys 


Glu 


Lys 


65 










70 










75 










fl 0 


Glu 


Asp 

XT 


Gin 


Val 


Leu 


Ser 


Tvr 


He 


Asn 


Glv 

v?x y 


Val 


Thr 


Thr 


Ser 


Lys 


Pro 










85 










90 










95 




Gly 


Val 


Ser 


Leu 


Val 


Tvr 


Ser 


Met 


Pro 


Ser 


Arg 


Asn 


Leu 


Ser 


Leu 


Arg 








100 










X \J -J 










110 






Val 


Glu 


Glv 


Leu 


Gin 


Glu 


xi y © 




Opr 
uCi 


vjx y 


Dyo 

xrx \J 


Tyr 


Ser 


Cys 


Ser 


Vdl 






1 1 c 

X X J 




















125 








Asn 


Val 






T.\/G 

xiy © 


m n 


\JX y 


T A/e 
Xj y a 




Arg 


wxy 


His 


Ser 


He 


Lys 


i nr 
























140 










Leu 


Glu 


Leu 


Asn 


Val 


Leu 


Val 


Pro 


Pro 


Ala 


Pro 


Pro 


Ser 


Cys 


Arg 


Leu 


145 










150 










155 










160 


Gin 


Gly 


Val 


Pro 


His 


Val 


Gly 


Ala 


Asn 


Val 


Thr 


Leu 


Ser 


Cys 


Gin 


Ser 










165 










170 










175 




Pro 


Arg 


Ser 


Lys 


Pro 


Ala 


Val 


Gin 


Tyr 


Gin 


Trp 


Asp 


Arg 


Gin 


Leu 


Pro 








180 










185 










190 






Ser 


Phe 


Gin 


Thr 


Phe 


Phe 


Ala 


Pro 


Ala 


Leu 


Asp 


Val 


He 


Arg Gly 


Ser 






195 










200 










205 








Leu 


Ser 


Leu 


Thr 


Asn 


Leu 


Ser 


Ser 


Ser 


Met 


Ala 


Gly Val 


Tyr Val 


Cys 




210 










215 










220 










Lys 


Ala 


His 


Asn 


Glu 


Val 


Gly 


Thr 


Ala 


Gin 


Cys 


Asn 


Val 


Thr 


Leu 


Glu 


225 










230 










235 










240 


Val 


Ser 


Thr 


Gly 


Pro 


Gly 


Ala 


Ala 


Val 


Val 


Ala 


Glu 


Ala 


Val 


Val 


Gly 










245 










250 










255 




Thr 


Leu 


Val 


Gly 


Leu 


Gly 


Leu 


Leu 


Ala 


Gly 


Leu 


Val 


Leu 


Leu 


Tyr 


His 








260 










265 










270 






Arg 


Arg 


Gly 


Lys 


Ala 


Leu 


Glu 


Glu 


Pro 


Ala 


Asn 


Asp 


He 


Lys 


Glu 


Asp 






275 










280 










285 








Ala 


He 


Ala 


Pro 


Arg 


Thr 


Leu 


Pro 


Trp 


Pro 


Lys 


Ser 


Ser Asp 


Thr 


He 




290 










295 










300 










Ser 


Lys 


Asn 


Gly 


Thr 


Leu 


Ser 


Ser 


Val 


Thr 


Ser 


Ala 


Arg Ala 


Leu 


Arg 


305 










310 










315 










320 
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Pro Pro His Gly Pro Pro Arg Pro 
325 

Leu Ser Ser Gin Ala Leu Pro Ser 
340 

Gly Pro Pro Ser Thr Asn lie Pro 
355 360 

Trp Leu 
370 



Gly Ala Leu Thr Pro Thr Pro Ser 

330 335 
Pro Arg His Ala His Asp Arg Trp 
345 350 
His Pro Trp Trp Gly Phe Phe Leu 
365 



<210> 135 

<211> 1869 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> modif ied_base 

<222> all "n" positions 

<223> n=a, c, g, or t 



<400> 135 

gtcgacccac gcgtncntcc agcgtncgga gccgccctgg gtgtcagcgg ctcggctccc 60 

gcgcacgctc cggccgtcgc gcagcctcgg cacctgcagg tccgtgcgtc ccgcggctgg 12 0 

cgcccctgac tccgtcccgg ccagggaggg ccatgatttc cctcccgggg cccctggtga 180 

ccaacttgnt gcggtttttg ttcctggggc tgagtgccct cgcgcccccc tcgcgggccc 240 

agctgcaact gcacttgccc gccaaccggt tgcaggcggt ggaggagggg gaaagtggtg 300 

cttcagcatg gtacaccttg cacagggagg cgtcttcatc ccagccatgg gaggtgccct 360 

ttgtgatgtg gttcttcaaa cagaaagaaa aggaggatca ggtgttgtcc tacatcaatg 420 

gggtcacaac aagcaaacct ggagtatcct tggtctactc catgccctcc cggaacctgt 480 

ccctgcgggt ggagggtctc caggagaaag actctggccc ctacagctgc tccgtgaatg 540 

tgcaagacaa acaaggcaaa tctaggggcc acagcatcaa aaccttagaa ctcaatgtac 600 

tggttcctcc agctcctcca tcctgccgtc tccagggtgt gccccatgtg ggggcaaacg 660 

tgaccctgag ctgccagtct ccaaggagta agcccgctgt ccaataccag tgggatcggc 720 

agcttccatc cttccagact ttctttgcac cagcattaga tgtcatccgt gggtctttaa 780 

gcctcaccaa cctttcgtct tccatggctg gagtctatgt ctgcaaggcc cacaatgagg 840 

tgggcactgc ccaatgtaat gtgacgctgg aagtgagcac agggcctgga gctgcagtgg 900 

ttgctgaagc tgttgtgggt accctggttg gactggggtt gctggctggg ctggtcctct 960 

tgtaccaccg ccggggcaag gccctggagg agccagccaa tgatatcaag gaggatgcca 102 0 

ttgctccccg gaccctgccc tggcccaaga gctcagacac aatctccaag aatgggaccc 1080 

tttcctctgt cacctccgca cgagccctcc ggccacccca tggccctccc aggcctggtg 114 0 

cattgacccc cacgcccagt ctatccagcc aggccctgcc ctcaccaaga catgcccacg 1200 

acagatgggg cccaccctca accaatatcc cccatccctg gtggggtttt ttcctttggc 1260 

tttgagccgc atgggtgctg ngcctgtgat ggngcctgcc cagagtcaag ctggctctct 1320 

ggtatgatga ccccaccact cattggctaa aggatttggg gtctctcctt cctataaggg 1380 

tcacctctag cacagaggcc tgagtcatgg gaaagagtca cactcctgac ccttagtact 144 0 

ctgcccccac ctctctttac tgtgggaaaa ccatctcagt aagacctaag tgtccaggag 1500 

acagaaggag aagaggaagt ggatctggaa ttgggaggag cctccaccca cccctgactc 1560 

ctccttatga agccagctgc tgaaattagc tactcaccaa gagtgagggg cagagacttc 1620 

cagtcactga gtctcccagg cccccttgat ctgtacccca cccctatcta acaccaccct 1680 

tggctcccac tccagctccc tgtattgata taacctgtca ggctggcttg gttaggtttt 1740 

actggggcag aggataggga atctcttatt aaaactaaca tgaaatatgt gttgttttca 1800 

tttgcaaatt taaataaaga tacataatgt ttgtatgaga taagaaaaaa aaaaaaaaag 1860 

ggcggccgc 1869 



<210> 136 

<211> 370 

<212> PRT 

<213> Homo sapiens 
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<220> 

<22l> SITE 

<222> (13) 

<223> Xaa=unknown amino acid 



<400> 136 
Met He Ser 
1 

Phe Leu Gly 

Leu His Leu 
35 

Gly Ala Ser 
50 

Pro Trp Glu 
65 

Glu Asp Gin 

Gly Val Ser 

Val Glu Gly 
115 

Asn Val Gin 

130 
Leu Glu Leu 
145 

Gin Gly Val 

Pro Arg Ser 

Ser Phe Gin 
195 

Leu Ser Leu 

210 
Lys Ala His 
225 

Val Ser Thr 

Thr Leu Val 

Arg Arg Gly 
275 

Ala He Ala 

290 
Ser Lys Asn 
305 

Pro Pro His 

Leu Ser Ser 

Gly Pro Pro 
355 

Trp Leu 
370 



Leu Pro 
5 

Leu Ser 
20 

Pro Ala 

Ala Trp 

Val Pro 

Val Leu 

85 
Leu Ala 
100 

Leu Gin 

Asp Lys 

Asn Val 

Pro His 
165 
Lys Pro 
180 

Thr Phe 

Thr Asn 

Asn Glu 

Gly Pro 
245 
Gly Leu 
260 

Lys Ala 

Pro Arg 

Gly Thr 

Gly Pro 
325 
Gin Ala 
340 

Ser Thr 



Gly Pro 

Ala Leu 

Asn Arg 

Tyr Thr 

55 
Phe Val 
70 

Ser Tyr 

Tyr Ser 

Glu Lys 

Gin Gly 
135 
Leu Val 
150 

Val Gly 

Ala Val 

Phe Ala 

Leu Ser 
215 
Val Gly 
230 

Gly Ala 

Gly Leu 

Leu Glu 

Thr Leu 
295 
Leu Ser 
310 

Pro Arg 
Leu Pro 
Asn He 



Leu 

Ala 

Leu 

40 

Leu 

Met 

He 

Met 

Asp 
120 
Lys 

Pro 

Ala 

Gin 

Pro 
200 
Ser 

Thr 

Ala 

Leu 

Glu 
280 
Pro 

Ser 

Pro 

Ser 

Pro 
360 



Val Thr Asn 
10 

Pro Pro Ser 
25 

Gin Ala Val 

His Arg Glu 

Trp Phe Phe 
75 

Asn Gly Val 
90 

Pro Ser Arg 
105 

Ser Gly Pro 

Ser Arg Gly 

Pro Ala Pro 
155 

Asn Val Thr 
170 

Tyr Gin Trp 
185 

Ala Leu Asp 

Ser Met Ala 

Ala Gin Cys 
235 

Val Val Ala 
250 

Ala Gly Leu 
265 

Pro Ala Asn 

Trp Pro Lys 

Val Thr Ser 
315 

Gly Ala Leu 

330 
Pro Arg His 
345 

His Pro Trp 



Leu Xaa 

Arg Ala 

Glu Glu 

45 
Val Ser 
60 

Lys Gin 

Thr Thr 

Asn Leu 

Tyr Ser 
125 
His Ser 
140 

Pro Ser 

Leu Ser 

Asp Arg 

Val He 
205 
Gly Val 
220 

Asn Val 

Glu Ala 

Val Leu 

Asp He 
285 
Ser Ser 
300 

Ala Arg 

Thr Pro 

Ala His 

Trp Gly 
365 



Arg Phe 
15 

Gin Leu 
30 

Gly Glu 

Ser Ser 

Lys Glu 

Ser Lys 
95 

Ser Leu 
110 

Cys Ser 

He Lys 

Cys Arg 

Cys Gin 
175 
Gin Leu 
190 

Arg Gly 

Tyr Val 

Thr Leu 

Val Val 
255 
Leu Tyr 
270 

Lys Glu 

Asp Thr 

Ala Leu 

Thr Pro 
335 
Asp Arg 
350 

Phe Phe 



Leu 

Gin 

Ser 

Gin 

Lys 

80 

Pro 

Arg 

Val 

Thr 

Leu 
160 
Ser 

Pro 

Ser 

Cys 

Glu 
240 
Gly 

His 

Asp 

He 

Arg 
320 
Ser 

Trp 

Leu 



<210> 137 
<211> 1869 
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<212> DNA 

<213> Homo sapiens 
<220> 

<221> modif ied_base 
<222> all "n" positions 
<223> n=a, c, g, or t 

<400> 137 

gtcgacccac gcgtncntcc agcgtncgga gccgccctgg gtgtcagcgg ctcggctccc 60 

gcgcacgctc cggccgtcgc gcagcctcgg cacctgcagg tccgtgcgtc ccgcggctgg 120 

cgcccctgac tccgtcccgg ccagggaggg ccatgatttc cctcccgggg cccctggtga 180 

ccaacttgnt gcggtttttg ttcctggggc tgagtgccct cgcgcccccc tcgcgggccc 240 

agctgcaact gcacttgccc gccaaccggt tgcaggcggt ggaggagggg gaaagtggtg 300 

cttcagcatg gtacaccttg cacagggagg tgtcttcatc ccagccatgg gaggtgccct 360 

ttgtgatgtg gttcttcaaa cagaaagaaa aggaggatca ggtgttgtcc tacatcaatg 420 

gggtcacaac aagcaaacct ggagtatcct tggcctactc catgccctcc cggaacctgt 480 

ccctgcgggt ggagggtctc caggagaaag actctggccc ctacagctgc tccgtgaatg 54 0 

tgcaagacaa acaaggcaaa tctaggggcc acagcatcaa aaccttagaa ctcaatgtac 600 

tggttcctcc agctcctcca tcctgccgtc tccagggtgt gccccatgtg ggggcaaacg 660 

tgaccctgag ctgccagtct ccaaggagta agcccgctgt ccaataccag tgggatcggc 720 

agcttccatc cttccagact ttctttgcac cagcattaga tgtcatccgt gggtctttaa 780 

gcctcaccaa cctttcgtct tccatggctg gagtctatgt ctgcaaggcc cacaatgagg 840 

tgggcactgc ccaatgtaat gtgacgctgg aagtgagcac agggcctgga gctgcagtgg 900 

ttgctgaagc tgttgtgggt accctggttg gactggggtt gctggctggg ctggtcctct 960 

tgtaccaccg ccggggcaag gccctggagg agccagccaa tgatatcaag gaggatgcca 1020 

ttgctccccg gaccctgccc tggcccaaga gctcagacac aatctccaag aatgggaccc 1080 

tttcctctgt cacctccgca cgagccctcc ggccacccca tggccctccc aggcctggtg 1140 

cattgacccc cacgcccagt ctatccagcc aggccctgcc ctcaccaaga catgcccacg 1200 

acagatgggg cccaccctca accaatatcc cccatccctg gtggggtttt ttcctttggc 1260 

tttgagccgc atgggtgctg ngcctgtgat ggngcctgcc cagagtcaag ctggctctct 1320 

ggtatgatga ccccaccact cattggctaa aggatttggg gtctctcctt cctataaggg 1380 

tcacctctag cacagaggcc tgagtcatgg gaaagagtca cactcctgac ccttagtact 1440 

ctgcccccac ctctctttac tgtgggaaaa ccatctcagt aagacctaag tgtccaggag 1500 

acagaaggag aagaggaagt ggatctggaa ttgggaggag cctccaccca cccctgactc 1560 

ctccttatga agccagctgc tgaaattagc tactcaccaa gagtgagggg cagagacttc 1620 

cagtcactga gtctcccagg cccccttgat ctgtacccca cccctatcta acaccaccct 1680 

tggctcccac tccagctccc tgtattgata taacctgtca ggctggcttg gttaggtttt 1740 

actggggcag aggataggga atctcttatt aaaactaaca tgaaatatgt gttgttttca 1800 

tttgcaaatt taaataaaga tacataatgt ttgtatgaga taagaaaaaa aaaaaaaaag 1860 

ggcggccgc 1869 

<210> 138 
<211> 370 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (13) 

<223> Xaa=unknown amino acid 



<400> 138 
Met lie Ser Leu Pro Gly Pro Leu 

1 5 
Phe Leu Gly Leu Ser Ala Leu Ala 
20 



Val Thr Asn Leu Xaa Arg Phe Leu 

' 10 15 
Pro Pro Ser Arg Ala Gin Leu Gin 
25 30 
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Leu His 


Leu 
35 


Pro 


Ala 


Asn 


Arg 


Leu 
40 


Gin 


Ala 


Val 


Glu 


Glu 
45 


Gly 


Glu 


Ser 


Gly Ala 


Ser 


Ala 


Trp Tyr 


Thr 


Leu 


His 


Arg 


Glu 


Val 


Ser 


Ser 


Ser 


Gin 


50 










55 










60 










Pro Trp 


Glu 


Val 


Pro 


Phe 


Val 


Met 


Trp 


Phe 


Phe 


Lys 


Gin 


Lys 


Glu 


Lys 


65 








70 










75 










80 


Glu Asp 


Gin 


Val 


Leu 


Ser Tyr 


He 


Asn 


Gly 


Val 


Thr 


Thr 


Ser 


Lys 


Pro 








85 










90 










95 




Gly Val 


Ser 


Leu 
100 


Val 


Tyr 


Ser 


Met 


Pro 
105 


Ser 


Arg 


Asn 


Leu 


Ser 
110 


Leu 


Arg 


Val Glu 


Gly 
115 


Leu 


Gin 


Glu 


Lys 


Asp 
120 


Ser 


Gly 


Pro 


Tyr 


Ser 
125 


Cys 


Ser 


Val 


Asn Val 


Gin 


Asp 


Lys 


Gin Gly 


Lys 


Ser 


Arg 


Gly 


His 


Ser 


He 


Lys 


Thr 


130 










135 










140 










Leu Glu 


Leu 


Asn 


Val 


Leu 


Val 


Pro 


Pro 


Ala 


Pro 


Pro 


Ser 


Cys 


Arg 


He 


145 








150 










155 










160 


Gin Gly 


Val 


Pro 


His 
165 


Val 


Gly 


Ala 


Asn 


Val 
170 


Thr 


Leu 


Ser 


Cys 


Gin 
175 


Ser 


Pro Arg 


Ser 


Lys 
180 


Pro 


Ala 


Val 


Gin 


Tyr 
185 


Gin 


Trp 


Asp 


Arg 


Gin 
190 


Leu 


Pro 


Ser Phe 


Gin 


Thr 


Phe 


Phe 


Ala 


Pro 


Ala 


Leu 


Asp 


Val 


He 


Arg 


Gly Ser 




195 










200 










205 








Leu Ser 


Leu 


Thr 


Asn 


Leu 


Ser 


Ser 


Ser 


Met 


Ala 


Gly 


Val 


Tyr 


Val 


Cys 


210 










215 










220 










Lys Ala 


His 


Asn 


Glu 


Val 


Gly 


Thr 


Ala 


Gin 


Cys 


Asn 


Val 


Thr 


Leu 


Glu 


225 








230 










235 










240 


Val Ser 


Thr 


Gly 


Pro Gly Ala 


Ala 


Val 


Val 


Ala 


Glu 


Ala 


Val 


Val 


Gly 








245 










250 










255 




Thr Leu 


Val 


Gly 


Leu Gly Leu 


Leu 


Ala 


Gly 


Leu 


Val 


Leu 


Leu 


Tyr 


His 






260 










265 










270 






Arg Arg 


Gly 
275 


Lys 


Ala 


Leu 


Glu 


Glu 
280 


Pro 


Ala 


Asn 


Asp 


He 
285 


Lys 


Glu 


Asp 


Ala He 


Ala 


Pro 


Arg 


Thr 


Leu 


Pro 


Trp 


Pro 


Lys 


Ser 


Ser 


Asp 


Thr 


He 


290 










295 










300 










Ser Lys 


Asn 


Gly 


Thr 


Leu 


Ser 


Ser 


Val 


Thr 


Ser 


Ala 


Arg 


Ala 


Leu 


Arg 


305 








310 










315 










320 


Pro Pro 


His 


Gly 


Pro 


Pro Arg 


Pro 


Gly 


Ala 


Leu 


Thr 


Pro 


Thr 


Pro 


Ser 








325 










330 










335 




Leu Ser 


Ser 


Gin 


Ala 


Leu 


Pro 


Ser 


Pro 


Arg 


His 


Ala 


His 


Asp 


Arg Trp 






34 0 










345 










350 






Gly Pro 


Pro 
355 


Ser 


Thr 


Asn 


He 


Pro 
360 


His 


Pro 


Trp 


Trp 


Gly 
365 


Phe 


Phe 


Leu 



Trp Leu 
370 



<210> 139 
<211> 1869 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> modif ied_base 
<222> all "n" positions 
<223> n=a, c, g, or t 

<400> 139 

gtcgacccac gcgtncntcc agcgtncgga gccgccctgg gtgtcagcgg ctcggctccc 60 
gcgcacgctc cggccgtcgc gcagcctcgg cacctgcagg tccgtgcgtc ccgcggctgg 120 
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cgcccctgac tccgtcccgg ccagggaggg ccatgatttc cctcccgggg cccctggtga 18 0 

ccaacttgnt gcggtttttg ttcctggggc tgagtgccct cgcgcccccc tcgcgggccc 240 

agctgcaact gcacttgccc gccaaccggt tgcaggcggt ggaggagggg gaaagtggtg 300 

cttcagcatg gtacaccttg cacagggagg tgtcttcatc ccagccatgg gaggtgccct 360 

ttgtgatgtg gttcttcaaa cagaaagaaa aggaggatca ggtgttgtcc tacatcaatg 42 0 

gggtcacaac aagcaaacct ggagtatcct tggtctactc catgccctcc cggaacctgt 4 80 

ccctgcgggt ggagggtctc caggagaaag actctggccc ctacagctgc tccgtgaatg 540 

tgcaagacaa acaaggcaaa tctaggggcc acagcatcaa aaccttagaa ctcaatgtac 600 

tggttcctcc agctcctcca tcctgccgta tccagggtgt gccccatgtg ggggcaaacg 660 

tgaccctgag ctgccagtct ccaaggagta agcccgctgt ccaataccag tgggatcggc 720 

agcttccatc cttccagact ttctttgcac cagcattaga tgtcatccgt gggtctttaa 780 

gcctcaccaa cctttcgtct tccatggctg gagtctatgt ctgcaaggcc cacaatgagg 84 0 

tgggcactgc ccaatgtaat gtgacgctgg aagtgagcac agggcctgga gctgcagtgg 900 

ttgctgaagc tgttgtgggt accctggttg gactggggtt gctggctggg ctggtcctct 960 

tgtaccaccg ccggggcaag gccctggagg agccagccaa tgatatcaag gaggatgcca 102 0 

ttgctccccg gaccctgccc tggcccaaga gctcagacac aatctccaag aatgggaccc 1080 

tttcctctgt cacctccgca cgagccctcc ggccacccca tggccctccc aggcctggtg 1140 

cattgacccc cacgcccagt ctatccagcc aggccctgcc ctcaccaaga catgcccacg 1200 

acagatgggg cccaccctca accaatatcc cccatccctg gtggggtttt ttcctttggc 1260 

tttgagccgc atgggtgctg ngcctgtgat ggngcctgcc cagagtcaag ctggctctct 1320 

ggtatgatga ccccaccact cattggctaa aggatttggg gtctctcctt cctataaggg 1380 

tcacctctag cacagaggcc tgagtcatgg gaaagagtca cactcctgac ccttagtact 1440 

ctgcccccac ctctctttac tgtgggaaaa ccatctcagt aagacctaag tgtccaggag 1500 

acagaaggag aagaggaagt ggatctggaa ttgggaggag cctccaccca cccctgactc 1560 

ctccttatga agccagctgc tgaaattagc tactcaccaa gagtgagggg cagagacttc 1620 

cagtcactga gtctcccagg cccccttgat ctgtacccca cccctatcta acaccaccct 1680 

tggctcccac tccagctccc tgtattgata taacctgtca ggctggcttg gttaggtttt 1740 

actggggcag aggataggga atctcttatt aaaactaaca tgaaatatgt gttgttttca 1800 

tttgcaaatt taaataaaga tacataatgt ttgtatgaga taagaaaaaa aaaaaaaaag 1860 

ggcggccgc ~ 1869 

<210> 140 
<211> 370 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (13) 

<223> Xaa=unknown amino acid 



<400> 140 



Met 


He 


Ser 


Leu 


Pro 


Gly 


Pro 


Leu 


Val 


Thr 


Asn 


Leu 


Xaa 


Arg 


Phe 


Leu 


1 








5 










10 










15 




Phe 


Leu 


Gly 


Leu 
20 


Ser 


Ala 


Leu 


Ala 


Pro 
25 


Pro 


Ser 


Arg 


Ala 


Gin 
30 


Leu 


Gin 


Leu 


His 


Leu 
35 


Pro 


Ala 


Asn 


Arg 


Leu 
40 


Gin 


Ala 


Val 


Glu 


Glu 
45 


Gly 


Glu 


Ser 


Gly 


Ala 
50 


Ser 


Ala 


Trp 


Tyr 


Thr 
55 


Leu 


His 


Arg 


Glu 


Val 
60 


Ser 


Ser 


Ser 


Gin 


Pro 


Trp 


Glu 


Val 


Pro 


Phe 


Val 


Met 


Trp 


Phe 


Phe 


Lys 


Gin 


Lys 


Glu 


Lys 


65 










70 










75 










80 


Glu 


Asp 


Gin 


Val 


Leu 
85 


Ser 


Tyr 


He 


Asn 


Gly 
90 


Val 


Thr 


Thr 


Ser 


Lys 
95 


Pro 


Gly 


Val 


Ser 


Leu 
100 


Val 


Tyr 


Ser 


Met 


Pro 
105 


Ser 


Arg 


Asn 


Leu 


Ser 
110 


Leu 


Arg 


Val 


Glu 


Gly 
115 


Leu 


Gin 


Glu 


Lys 


Asp 
120 


Ser 


Gly 


Pro 


Tyr 


Ser 
125 


Cys 


Ser 


Val 
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Asn Val 


Gin 


Asp 


Lys 


Gin Gly Lys 


130 








135 


Leu Glu 


Leu 


Asn 


Val 


Leu Val Pro 


145 








150 


Gin Gly 


Val 


Pro 


His 


Val Gly Ala 








165 




Pro Arg 


Ser 


Lys 


Pro 


Val Val Gin 






180 






Ser Phe 


Gin 


Thr 


Phe 


Phe Ala Pro 




195 






200 


Leu Ser 


Leu 


Thr 


Asn 


Leu Ser Ser 


210 








215 


Lys Ala 


His 


Asn 


Glu 


Val Gly Thr 


225 








230 


Val Ser 


Thr 


Gly 


Pro 


Gly Ala Ala 








245 




Thr Leu 


Val 


Gly 


Leu 


Gly Leu Leu 






260 






Arg Arg 


Gly 


Lys 


Ala 


Leu Glu Glu 




275 






280 


Ala He 


Ala 


Pro 


Arg 


Thr Leu Pro 


290 








295 


Ser Lys 


Asn 


Gly 


Thr 


Leu Ser Ser 


305 








310 


Pro Pro 


His 


Gly 


Pro 


Pro Arg Pro 








325 




Leu Ser 


Ser 


Gin 


Ala 


Leu Pro Ser 






340 






Gly Pro 


Pro 


Ser 


Thr 


Asn He Pro 




355 






360 



Trp Leu 
370 



Ser 


Arg 


Gly 


His 


Ser 


He 


Lys 


Thr 








140 










Pro 


Ala 


Pro 


Pro 


Ser 


Cys 


Arg 


Leu 






155 










160 


Asn 


Val 


Thr 


Leu 


Ser 


Cys 


Gin 


Ser 




170 










175 




Tyr 


Gin 


Trp 


Asp 


Arg 


Gin 


Leu 


Pro 


185 










190 






Ala 


Leu 


Asp 


Val 


He 


Arg 


Gly 


Ser 










205 








Ser. 


Met 


Ala 


Gly 


Val 


Tyr 


Val 


Cys 








220 










Ala 


Gin 


Cys 


Asn 


Val 


Thr 


Leu 


Glu 






235 










240 


Val 


Val 


Ala 


Glu 


Ala 


Val 


Val 


Gly 




250 










255 




Ala 


Gly 


Leu 


Val 


Leu 


Leu 


Tyr 


His 


265 










270 






Pro 


Ala 


Asn 


Asp 


He 


Lys 


Glu 


Asp 










285 








Trp 


Pro 


Lys 


Ser 


Ser 


Asp 


Thr 


He 








300 










Val 


Thr 


Ser 


Ala 


Arg 


Ala 


Leu 


Arg 






315 










320 


Gly 


Ala 


Leu 


Thr 


Pro 


Thr 


Pro 


Ser 




330 










335 




Pro 


Arg 


His 


Ala 


His 


Asp 


Arg 


Trp 


345 










350 






His 


Pro 


Trp 


Trp 


Gly 


Phe 


Phe 


Leu 



365 



:210> 141 

:211> 1869 

;212> DNA 

:213> Homo sapiens 
:220> 

:221> modif ied_base 

222> all "n" positions 

;223> n=a, c, g, or t 

;400> 141 

gtcgacccac gcgtncntcc agcgtncgga 

gcgcacgctc cggccgtcgc gcagcctcgg 

cgcccctgac tccgtcccgg ccagggaggg 

ccaacttgnt gcggtttttg ttcctggggc 

agctgcaact gcacttgccc gccaaccggt 

cttcagcatg gtacaccttg cacagggagg 

ttgtgatgtg gttcttcaaa cagaaagaaa 

gggtcacaac aagcaaacct ggagtatcct 

ccctgcgggt ggagggtctc caggagaaag 

tgcaagacaa acaaggcaaa tctaggggcc 

tggttcctcc agctcctcca tcctgccgtc 

tgaccctgag ctgccagtct ccaaggagta 

agcttccatc cttccagact ttctttgcac 

gcctcaccaa cctttcgtct tccatggctg 



gccgccctgg gtgtcagcgg ctcggctccc 60 

cacctgcagg tccgtgcgtc ccgcggctgg 120 

ccatgatttc cctcccgggg cccctggtga 180 

tgagtgccct cgcgcccccc tcgcgggccc 240 

tgcaggcggt ggaggagggg gaaagtggtg 300 

tgtcttcatc ccagccatgg gaggtgccct 360 

aggaggatca ggtgttgtcc tacatcaatg 420 

tggtctactc catgccctcc cggaacctgt 480 

actctggccc ctacagctgc tccgtgaatg 540 

acagcatcaa aaccttagaa ctcaatgtac 600 

tccagggtgt gccccatgtg ggggcaaacg 660 

agcccgttgt ccaataccag tgggatcggc 720 

cagcattaga tgtcatccgt gggtctttaa 780 

gagtctatgt ctgcaaggcc cacaatgagg 840 
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tgggcactgc ccaatgtaat gtgacgctgg aagtgagcac agggcctgga gctgcagtgg 900 

ttgctgaagc tgttgtgggt accctggttg gactggggtt gctggctggg ctggtcctct 960 

tgtaccaccg ccggggcaag gccctggagg agccagccaa tgatatcaag gaggatgcca 1020 

ttgctccccg gaccctgccc tggcccaaga gctcagacac aatctccaag aatgggaccc 1080 

tttcctctgt cacctccgca cgagccctcc ggccacccca tggccctccc aggcctggtg 1140 

cattgacccc cacgcccagt ctatccagcc aggccctgcc ctcaccaaga catgcccacg 1200 

acagatgggg cccaccctca accaatatcc cccatccctg gtggggtttt ttcctttggc 1260 

tttgagccgc atgggtgctg ngcctgtgat ggngcctgcc cagagtcaag ctggctctct 132 0 

ggtatgatga ccccaccact cattggctaa aggatttggg gtctctcctt cctataaggg 13 80 

tcacctctag cacagaggcc tgagtcatgg gaaagagtca cactcctgac ccttagtact 1440 

ctgcccccac ctctctttac tgtgggaaaa ccatctcagt aagacctaag tgtccaggag 1500 

acagaaggag aagaggaagt ggatctggaa ttgggaggag cctccaccca cccctgactc 1560 

ctccttatga agccagctgc tgaaattagc tactcaccaa gagtgagggg cagagacttc 1620 

cagtcactga gtctcccagg cccccttgat ctgtacccca cccctatcta acaccaccct 1680 

tggctcccac tccagctccc tgtattgata taacctgtca ggctggcttg gttaggtttt 174 0 

actggggcag aggataggga atctcttatt aaaactaaca tgaaatatgt gttgttttca 1800 

tttgcaaatt taaataaaga tacataatgt ttgtatgaga taagaaaaaa aaaaaaaaag 1860 

ggcggccgc ~ 1869 
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<210> 143 

<211> 1846 

<212> DNA 

<213> Mus musculus 



<400> 143 

gtcgacccac gcgtccggtg cacattcggg ttgccgccgc tcacccacaa cacctgtaga 60 

caccgtgtgt ccaactctcc ctgagtactc cgggccaagg agggccatga ttcttcaggc 120 

tggaaccccc gagaccagct tgctgcgggt tttgttcctg ggactgagta cccttgctgc 180 

cttctcccga gctcagatgg agttgcacgt gcccccgggc ctcaacaaat tggaagcggt 24 0 

agagggagaa gaagtggtgc tccccgcctg gtacacgatg gcacgggagg agtcgtggtc 300 

ccacccccgg gaggtgccca tcatgatctg gttcttggaa . caagaaggga aggaaccaaa 360 

ccaggtgttg tcttacatta atggagtcat gacaaataaa cctggaacag ccctggtcca 420 

ctctatctct tcacggaatg tgtccctgcg cctgggggca ctccaggagg gagactctgg 480 

gacttaccgc tgttctgtca atgtgcagaa tgatgaaggc aaaagtatag gccacagcat 54 0 

caaaagcata gagctcaaag tgctggttcc tccagctcct ccatcctgta gtttacaggg 600 

tgtaccctat gtcgggacca atgtgaccct gaactgcaag tccccaagga gtaaacctac 660 

tgctcagtac cagtgggaga ggctggcccc atcctcccag gtcttctttg gaccagcctt 720 

agatgctgtt cgtggatctt taaagctcac taacctttcc attgccatgt ctggagtcta 780 

tgtctgcaag gctcaaaaca gagtgggctt tgccaagtgc aacgtgacct tggacgtgat 84 0 

gacagggtcc aaggctgcag tggtcgctgg agcagttgtg ggcacttttg ttgggttggt 900 

gctgatagct gggctggtcc tgttgtacca gcgccggagc aagaccttgg aagagctggc 960 

caatgatatc aaggaagatg ccattgctcc ccggaccttg ccttggacca aaggctcaga 1020 

cacaatctcc aagaatggga cactttcttc ggtcacctca gcacgagctc tgcggccacc 1080 
caaggctgct cctccaagac ctggcacatt tactcccaca cccagtgtct ctagccaggc * 1140 

cctgtcctca ccaagactgc ccagggtaga tgaaccccca cctcaggcag tgtccctgac 1200 

cccaggtggg gtttcttctt ctgctctgag ccgcatgggt gctgtgcctg tgatggtgcc 1260 

tgcacagagt caggctgggt ctcttgtgtg atagcccagg cactcattag ctacatctgg 132 0 

tatctgacct ttctgtaaag gtctccttgt ggcacagagg actcaatctt gggaggatgc 1380 

ccacattcta gacctccagt cctttgctcc tacctccttc tattgttgga atactgggcc 1440 

tcagtaagac taaaatctgg gtcaaaggac aaaaggagga aatggacctg aggtaggggg 150 0 

ttgggagtga ggaggcttca cttcctccct gcttctccct gaagccagat gaatgctgcg 1560 

gaagatcggc taccctccaa gggctctgga ggagactgcc agtcagtgat gcccctggct 162 0 

ctgtgatctg tacaacaccc ttatctaatg ctgtcctttg ccgttcgctc catctccctg 1680 

tattaatata acctgtcctg ctggcttggc tgggttttgt tgtagcaggg ggataggaaa 1740 

gacattttaa aatctgactt gaaattgatg tttttgtttt tattttgcaa atttcaataa 1800 

agatacatcg catttgcatg gaaaaaaaaa aaaaaagggc ggccgc 1846 

<210> 144 
<211> 394 
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<212> PRT 

<213> Mus musculus 
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<213> Mus musculus 
<400> 145 

gtcgacccac gcgtccggtg cacattcggg ttgccgccgc tcacccacaa cacctgtaga 60 

caccgtgtgt ccaactctcc ctgagtactc cgggccaagg agggccatga ttcttcaggc 120 

tggaaccccc gagaccagct tgctgcgggt tttgttcctg ggactgagta cccttgctgc 180 

cttctcccga gctcagatgg agttgcacgt gcccccgggc ctcaacaaat tggaagcggt 24 0 

agagggagaa gaagtggtgc tccccgcctg gtacacgatg gcacgggagg agtcgtggtc 3 00 

ccacccccgg gaggtgccca tcctgatctg gttcttggaa caagaaggga aggaaccaaa 360 

ccaggtgttg tcttacatta atggagtcat gacaaataaa cctggaacag ccctggtcca 420 

ctctatctct tcacggaatg tgtccctgcg cctgggggca ctccaggagg gagactctgg 4 80 

gacttaccgc tgttctgtca atgtgcagaa tgatgaaggc aaaagtatag gccacagcat 540 

caaaagcata gagctcaaag cgctggttcc tccagctcct ccatcctgta gtttacaggg 600 

tgtaccctat gtcgggacca atgtgaccct gaactgcaag tccccaagga gtaaacctac 660 

tgctcagtac cagtgggaga ggctggcccc atcctcccag gtcttctttg gaccagcctt 720 

agatgctgtt cgtggatctt taaagctcac taacctttcc attgccatgt ctggagtcta 780 

tgtctgcaag gctcaaaaca gagtgggctt tgccaagtgc aacgtgacct tggacgtgat 84 0 

gacagggtcc aaggctgcag tggtcgctgg agcagttgtg ggcacttttg ttgggttggt 900 

gctgatagct gggctggtcc tgttgtacca gcgccggagc aagaccttgg aagagctggc 960 

caatgatatc aaggaagatg ccattgctcc ccggaccttg ccttggacca aaggctcaga 102 0 

cacaatctcc aagaatggga cactttcttc ggtcacctca gcacgagctc tgcggccacc 1080 

caaggctgct cctccaagac ctggcacatt tactcccaca cccagtgtct ctagccaggc 114 0 

cctgtcctca ccaagactgc ccagggtaga tgaaccccca cctcaggcag tgtccctgac 1200 

cccaggtggg gtttcttctt ctgctctgag ccgcatgggt gctgtgcctg tgatggtgcc 1260 

tgcacagagt caggctgggt ctcttgtgtg atagcccagg cactcattag ctacatctgg 132 0 

tatctgacct ttctgtaaag gtctccttgt ggcacagagg actcaatctt gggaggatgc 1380 

ccacattcta gacctccagt cctttgctcc tacctccttc tattgttgga atactgggcc 1440 

tcagtaagac taaaatctgg gtcaaaggac aaaaggagga aatggacctg aggtaggggg 1500 

ttgggagtga ggaggcttca cttcctccct gcttctccct gaagccagat gaatgctgcg 1560 

gaagatcggc taccctccaa gggctctgga ggagactgcc agtcagtgat gcccctggct 162 0 

ctgtgatctg tacaacaccc ttatctaatg ctgtcctttg ccgttcgctc catctccctg 1680 

tattaatata acctgtcctg ctggcttggc tgggttttgt tgtagcaggg ggataggaaa 174 0 

gacattttaa aatctgactt gaaattgatg tttttgtttt tattttgcaa atttcaataa 1800 

agatacatcg catttgcatg gaaaaaaaaa aaaaaagggc ggccgc 184 6 
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<210> 147 
<211> 1846 
<212> DNA 

<213> Mus musculus 

<400> 147 

gtcgacccac gcgtccggtg 

caccgtgtgt ccaactctcc 

tggaaccccc gagaccagct 

cttctcccga gctcagatgg 

agagggagaa gaagtggtgc 

ccacccccgg gaggtgecca 

ccaggtgttg tcttacatta 

ctctatctct teaeggaatg 

gacttaccgc tgttctgtca 

caaaagcata gagctcaaag 

tgtaccctat gtegggacca 

tgetcagtae cagtgggaga 

agatgctgtt cgtggatctt 

tgtctgcaag gctcaaaaca 

gacagggtcc aaggctgcag 

gctgatagct gggctggtcc 

caatgatatc aaggaagatg 

cacaatctcc aagaatggga 



cacatteggg ttgccgccgc tcacccacaa cacctgtaga 60 

ctgagtactc egggecaagg agggecatga ttcttcaggc 120 

tgctgcgggt tttgttcctg ggactgagta cccttgctgc 180 

agttgcacgt gcccccgggc ctcaacaaat tggaagcggt 240 

tccccgcctg gtacacgatg geaegggagg agtcgtggtc 300 

tcctgatctg gttcttggaa caagaaggga aggaaccaaa 360 

atggagtcat gacaaataaa cctggaacag ccctggtcca 420 

tgtccctgcg cctgggggca ctccaggagg gagactctgg 480 

atgtgcagaa tgatgaaggc aaaagtatag gccacagcat 54 0 

tgctggttcc tccagctcct ccatcctgta gtttacaggg 600 

atgtgaccct gaactgcaag tccccaagga gtaaacctac 660 

ggctggtccc atcctcccag gtcttctttg gaccagcctt 720 

taaagctcac taacctttcc attgecatgt ctggagtcta 780 

gagtgggctt tgccaagtgc aacgtgacct tggacgtgat 84 0 

tggtcgctgg agcagttgtg ggcacttttg ttgggttggt 900 

tgttgtacca gcgccggagc aagaccttgg aagagctggc 960 

ccattgctcc ccggaccttg ccttggacca aaggctcaga 1020 

cactttcttc ggtcacctca gcacgagctc tgcggccacc 1080 
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caaggctgct cctccaagac ctggcacatt tactcccaca cccagtgtct ctagccaggc 1140 

cctgtcctca ccaagactgc ccagggtaga tgaaccccca cctcaggcag tgtccctgac 12 00 

cccaggtggg gtttcttctt ctgctctgag ccgcatgggt gctgtgcctg tgatggtgcc 1260 

tgcacagagt caggctgggt ctcttgtgtg atagcccagg cactcattag ctacatctgg 1320 

tatctgacct ttctgtaaag gtctccttgt ggcacagagg actcaatctt gggaggatgc 13 80 

ccacattcta gacctccagt cctttgctcc tacctccttc tattgttgga atactgggcc 1440 

tcagtaagac taaaatctgg gtcaaaggac aaaaggagga aatggacctg aggtaggggg 1500 

ttgggagtga ggaggcttca cttcctccct gcttctccct gaagccagat gaatgctgcg 1560 

gaagatcggc taccctccaa gggctctgga ggagactgcc agtcagtgat gcccctggct 1620 

ctgtgatctg tacaacaccc ttatctaatg ctgtcctttg ccgttcgctc catctccctg 1680 

tattaatata acctgtcctg ctggcttggc tgggttttgt tgtagcaggg ggataggaaa 1740 

gacattttaa aatctgactt gaaattgatg tttttgtttt tattttgcaa atttcaataa 1800 

agatacatcg catttgcatg gaaaaaaaaa aaaaaagggc ggccgc 1846 

<210> 148 
<211> 394 
<212> PRT 

<213> Mus musculus 



<400> 148 



Met He 


Leu 


Gin 


Ala 


Gly 


Thr 


Pro 


Glu 


Thr 


Ser 


Leu 


Leu Arg Val Leu 


1 






5 










10 






15 


Phe Leu 


Gly 


Leu 


Ser 


Thr 


Leu 


Ala 


Ala 


Phe 


Ser Arg 


Ala Gin Met Glu 






20 










25 








30 


Leu His 


Val 


Pro 


Pro 


Gly 


Leu 


Asn 


Lys 


Leu 


Glu 


Ala 


Val Glu Gly Glu 




35 










40 










45 


Glu Val 


Val 


Leu 


Pro 


Ala 


Trp 


Tyr 


Thr 


Met 


Ala Arg 


Glu Glu Ser Trp 


50 










55 










60 




Ser His 


Pro 


Arg 


Glu 


Val 


Pro 


He 


Leu 


He 


Trp 


Phe 


Leu Glu Gin Glu 


65 








70 










75 




R 0 

O \J 


Gly Lys 


Glu 


Pro 


Asn 


Gin 


Val 


Leu 


Ser 


Tyr 


He 


Asn 


Glv Val Met Thr 








85 










90 








Asn Lys 


Pro 


Gly 


Thr 


Ala 


Leu 


Val 


His 


Ser 


He 


Ser 


Ser Ara A«?n Val 

wcjl y noil v a J. 






100 










105 








1 1 n 

J. X U ■ 


Ser Leu Arg 


Leu 


Gly 


Ala 


Leu 


Gin 


Glu Gly 


Asp 


Ser 


Gly Thr Tyr Arg 




115 










120 










125 


Cys Ser 


Val 


Asn 


Val 


Gin 


Asn 


Asp 


Glu Gly 


Lys 


Ser 


He Gly His Ser 


130 










135 










140 




He Lys 


Ser 


He 


Glu 


Leu 


Lys 


Val 


Leu 


Val 


Pro 


Pro 


Ala Pro Pro Ser 


145 








150 










155 




160 


Cys Ser 


Leu 


Gin 


Gly 


Val 


Pro 


Tyr 


Val 


Gly 


Thr 


Asn 


Val Thr Leu Asn 








165 










170 






175 


Cys Lys 


Ser 


Pro 


Arg 


Ser 


Lys 


Pro 


Thr 


Ala 


Gin Tyr 


Gin Trp Glu Arg 






180 










185 








190 


Leu Ala 


Pro 


Ser 


Ser 


Gin 


Val 


Phe 


Phe 


Gly 


Pro 


Ala 


Leu Asp Ala Val 




195 










200 










205 


Arg Gly Ser 


Leu 


Lys 


Leu 


Thr 


Asn 


Leu 


Ser 


He 


Ala 


Met Ser Gly Val 


210 










215 










220 




Tyr Val 


Cys 


Lys 


Ala 


Gin 


Asn 


Arg 


Val 


Gly 


Phe 


Ala 


Lys Cys Asn Val 


225 








230 










235 




. 240 


Thr Leu Asp 


Val 


Met 


Thr 


Gly 


Ser 


Lys 


Ala 


Ala 


Val 


Val Ala Gly Ala 








245 










250 






255 


Val Val 


Gly 


Thr 


Phe 


Val 


Gly 


Leu 


Val 


Leu 


He 


Ala 


Gly Leu Val Leu 






260 










265 








270 


Leu Tyr 


Gin 


Arg 


Arg 


Ser 


Lys 


Thr 


Leu 


Glu 


Glu 


Leu 


Ala Asn Asp He 




275 










280 










285 


Lys Glu Asp 


Ala 


He 


Ala 


Pro 


Arg 


Thr 


Leu 


Pro Trp 


Thr Lys Gly Ser 


290 










2 95 










300 
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Asp Thr 


lie 


Ser 


Lys 


TV am 

Asn 


Gly 


Tnr 


Leu Ser Ser Val 


Thr 


Ser 


Ala Arg 


r\ r~ 

305 








310 






315 






ion 


Ala Leu 


7V *-j»<r 

Arg 


Pro 


Pro 


T i rn 

Lys 


Ala 


Aia 


Pro Pro Arg Pro Gly Thr 


riie inr 








325 








ion 
330 






Jib 


Pro Thr 


Pro 


Ser 


val 


Ser 


Ser 


Gin 


Ala Leu Ser Ser 


Pro 


Arg 


Lieu riO 






340 










345 




350 




Arg Val 


TV nv\ 

ASp 


GlU 


Pro 


Pro 


Pro 


Gin 


Ala Val Ser Leu 


Thr 


Pro 


vjj.y o±y 




355 










360 




365 






Val Ser 


Ser 


Ser 


Val 


Leu 


Ser 


Arg 


Met Gly Ala Val 


Pro 


Val 


Met Val 


370 










375 




380 








Pro Ala 


Gin 


Ser 


Gin 


Ala 


Gly 


Ser 


Leu Val 








385 








390 















<210> 149 

<211> 1846 

<212> DNA 

<213> Mus musculus 



<400> 149 

gtcgacccac gcgtccggtg cacatteggg ttgccgccgc tcacccacaa cacctgtaga 60 

caccgtgtgt ccaactctcc ctgagtactc egggecaagg agggecatga ttcttcaggc 120 

tggaaccccc gagaccagct tgctgcgggt tttgttcctg ggactgagta cccttgctgc 180 

cttctcccga gctcagatgg agttgcacgt gcccccgggc ctcaacaaat tggaagcggt 240 

agagggagaa gaagtggtgc tccccgcctg gtacacgatg geaegggagg agtcgtggtc 300 

ccacccccgg gaggtgecca tcctgatctg gttcttggaa caagaaggga aggaaccaaa 360 

ccaggtgttg tcttacatta atggagtcat gacaaataaa cctggaacag ccctggtcca 420 

ctctatctct teaeggaatg tgtccctgcg cctgggggca ctccaggagg gagactctgg 480 

gacttaccgc tgttctgtca atgtgcagaa tgatgaaggc aaaagtatag gccacagcat 540 

caaaagcata gagctcaaag tgctggttcc tccagctcct ccatcctgta gtttacaggg 600 

tgtaccctat gtegggacca atgtgaccct gaactgcaag tccccaagga gtaaacctac 660 

tgetcagtae cagtgggaga ggctggcccc atcctcccag gtcttctttg gaccagcctt 720 

agatgctgtt cgtggatctt taaagctcac taacctttcc attgecatgt ctggagtcta 780 

tgtctgcaag gctcaaaaca gagtgggctt tgccaagtgc aacgtgacct tggacgtgat 84 0 

gacagggtcc aaggctgcag tggtcgctgg agcagttgtg ggcacttttg ttgggttggt 900 

gctgatagct gggctggtcc tgttgtacca gcgccggagc aagaccttgg aagagctggc 960 

caatgatatc aaggaagatg ccattgctcc ccggaccttg ccttggacca aaggctcaga 102 0 

cacaatctcc aagaatggga cactttcttc ggtcacctca gcacgagctc tgcggccacc 1080 

caaggctget cctccaagac ctggcacatt tactcccaca cccagtgtct ctagccaggc 114 0 

cctgtcctca ccaagactgc ccagggtaga tgaaccccca cctcaggcag tgtccctgac 12 00 

cccaggtggg gtttcttctt ctgttctgag ccgcatgggt gctgtgcctg tgatggtgcc 1260 

tgcacagagt caggctgggt ctcttgtgtg atageccagg cactcattag ctacatctgg 1320 

tatctgacct ttctgtaaag gtctccttgt ggcacagagg actcaatctt gggaggatgc 1380 

ccacattcta gacctccagt cctttgctcc tacctccttc tattgttgga atactgggee 1440 

tcagtaagac taaaatctgg gtcaaaggac aaaaggagga aatggacctg aggtaggggg 1500 

ttgggagtga ggaggcttca cttcctccct gcttctccct gaagecagat gaatgctgcg 1560 

gaagategge taccctccaa gggctctgga ggagactgee agtcagtgat gcccctggct 1620 

ctgtgatctg tacaacaccc ttatctaatg ctgtcctttg ccgttcgctc catctccctg 1680 

tattaatata acctgtcctg ctggcttggc tgggttttgt tgtagcaggg ggataggaaa 174 0 

gacattttaa aatctgactt gaaattgatg tttttgtttt tattttgeaa atttcaataa 1800 

agatacatcg catttgeatg gaaaaaaaaa aaaaaagggc ggcege 1846 

<210> 150 
<211> 245 
<212> PRT 

<213> Homo sapiens 
<400> 150 

Met Arg Leu Phe Val Arg Pro Ser Val Arg Pro Ala Met Ala Ala Pro 
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1 








5 










10 










15 




Ala 


Pro 


Ser 


Pro 


Trp 


Thr 


Leu 


Ser 


Leu 


Leu 


Leu 


Leu 


Leu 


Leu 


Leu 


Pro 








20 










25 










30 






Ser 


Pro 


Gly Ala 


His 


Gly 


Glu 


Leu 


Cys 


Arg 


Pro 


Phe 


Gly 


Glu 


Asp 


Asn 






35 










40 










45 








Ser 


He 


Pro 


Glu 


Ser 


Cys 


Pro 


Asp 


Phe 


Cys 


Cys 


Gly 


Ser 


Cys 


Ser 


Ser 




50 










55 










60 










Gin 


Tyr 


Cys 


Cys 


Ser 


Asp 


Val 


Leu 


Lys 


Lys 


He 


Gin 


Trp 


Asn 


Glu 


Glu 


65 










70 










75 










80 


Met 


Cys 


Pro 


Glu 


Pro 


Glu 


Ser 


Ser 


Arg 


Phe 


Ser 


Ala 


His 


Pro 


Glu 


Thr 










85 










90 










95 




Pro 


Glu 


Gin 


Leu 


Gly 


Ser 


Val 


Leu 


Lys 


Tyr 


Gin 


Ser 


Ser 


Leu 


Asp 


Ser 








100 










105 










110 






Asp 


Asn 


Met 


Pro 


Gly 


Phe 


Gly 


Ala 


Thr 


Val 


Ala 


He 


Gly 


Leu 


Thr 


Val 






115 










120 










125 








Phe 


Val 


Val 


Phe 


He 


Ala 


Thr 


He 


He 


Val 


Cys 


Phe 


Thr 


Cvs 


Ser 


Cvs 




130 










135 










140 








Cys 


Cys 


Leu 


Tyr 


Lys 


Met 


Cys 


Cys 


Arg 


Pro 


Arg 


Pro 


Val 


Val 


Ser 


Asn 


145 










150 










155 










160 


Thr 


Thr 


Thr 


Thr 


Thr 


Val 


Val 


His 


Thr 


Ala 


Tyr 


Pro 


Gin 


Pro 


Gin 


Pro 










165 










170 










175 




Val 


Ala 


Pro 


Ser 


Tyr 


Pro 


Gly 


Pro 


Thr 


Tyr 


Gin 


Gly 


Tyr 


His 


Pro 


Met 








180 










185 










190 






Pro 


Pro 


Gin 


Pro 


Gly 


Met 


Pro 


Ala 


Ala 


Pro 


Tyr 


Pro 


Thr 


Gin 


Tyr 


Pro 






195 










200 










205 








Pro 


Pro 


Tyr 


Leu 


Ala 


Gin 


Pro 


Thr 


Gly 


Pro 


Pro 


Ala 


Tyr 


His 


Glu 


Thr 




210 










215 










220 










Leu 


Ala 


Gly Ala 


Ser 


Gin 


Pro 


Pro 


Tyr 


Asn 


Pro 


Ala 


Tyr 


Met 


Asp 


Pro 


225 










230 










235 










240 


Pro 


Lys 


Ala 


Val 


Pro 

























245 



<210> 151 
<211> 1801 
<212> DNA 

<213> Homo sapiens 
<400> 151 

gtcgacccac gcgtccggcg gaggttgtgg ctgcaccgtg gtcctgggct tggtcctggg 60 

cttgatgcgt ctgtttgtcc gtccgtccgt ccgtcccgcc atggctgcgc cggcgccctc 12 0 

tccgtggacc ctttcgctgc tgctgttgtt gctactgccg tctccgggtg cccatggcga 180 

gctgtgcagg cccttcggtg aagacaattc gatcccagag tcctgtcctg acttctgttg 240 

tggctcctgt tccagccaat actgctgctc tgacgtgctg aagaaaatcc agtggaatga 300 

ggaaatgtgc cctgagccag agtccagcag attttccgcc cacccggaga caccagaaca 360 

gctgggttca gtgctgaagt atcagtccag tcttgacagt gacaacatgc cagggttcgg 42 0 

agcgaccgtg gccatcggcc tgaccgtctt cgtggtgttt atcgctacca tcattgtgtg 480 

ctttacctgc tcctgctgct gtctatataa gatgtgctgc cgcccacgac ctgtcgtgtc 54 0 

caacaccaca actactaccg tggttcacac cgcttaccct cagcctcaac ctgtggcccc 600 

cagctatcct ggaccaacat accagggcta ccatcccatg cccccccagc caggaatgcc 660 

agcagcaccc tacccaacgc agtaccctcc accctacctg gcccagccca cagggccacc 720 

agcctatcat gagacgttgg ctggagccag ccagcctcca tacaacccgg cctacatgga 780 

tcccccaaag gcagttccct gagcctgccc ccagcctctt tggctaacat ttgattatgt 84 0 

catgtgtgtg tgagtgctat gcagagttct ttactgctgt ctgtggtgcg tgtgccttgt 900 

ctagacatgt ggcttcctct gctgatgacc aggtaggcac aaatcttacc agtgctggtt 960 

gggaccaatc tgttttcttc ctcacttgaa attgtaattt ctgaaatttc aagtaaatta 1020 

aaaacaatag ggtaggaggt atttcccgct tcaccccaag gtgaccagcc atagcctgcc 1080 

acacatagga gagcaagctt tttgtgggtc catgtcctgc tttggggagt agccagctag 114 0 

ctgctgctat gggtttattc ccagggcttg gctgcattta gctggacaga gaacaagggg . 1200 
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cctcagtggc agtgggtcag tgactgatgt cagagcacac taggcagaga gccccgtccg 1260 

tctccatcag ctgtctgtct ggacggtccc actgtctttc ctgggactat gtagagggcc 1320 

acatgtattc actattcagg ctccagtggc ttccaggcca ggggcctctg tctactacac 1380 

actctggttt ctccctacag tgtcttttta cgattagcca aacatattgc ctgttttttg 1440 

tatccagatg tgtgataatt ggtgaggttg aaatccttgg ttcctggaga acaggaaacc 1500 

tgacctctga cagtccgttt cccttgacac cagcttcata gaatacctga ctcctgtact 1560 

acagtccagt ttgttccagt agcagggaca ccagggccag gggttatctg gaccaagggt 162 0 

99999t99 a 9 agcctggatg gtagctctgg accagatgtg aatgcctcca tattccctgt 1680 

tggttcctgt ttcactggct gttttagttt tgtgttaatt ggtgtttctg agcattcaaa 1740 

ctccgcaccc tcgtttataa taaatgaata tttggaaaaa aaaaaaaaaa aaaaaaaaaa 1800 

a 1801 

<210> 152 
<211> 245 
<212> PRT 
<213> Homo sapiens 



<400> 152 



Met Arg 


Leu 


Phe 


Val 


Arg 


Pro 


Ser 


Val 


Arg 


Pro 


Ala 


Met 


Ala 


Ala 


Pro 


l 






5 










10 










15 




Ala Pro 


Ser 


Pro 


Trp 


Thr 


Leu 


Ser 


Leu 


Leu 


T a* i 

Leu 


Leu 


T ail 

Leu 


T All 

Leu 


Leu 


Pro 






20 










25 










30 






Ser Pro Gly Ala 


His 


Gly 


Glu 


Leu 


Cys 


Arg 


Pro 


Pne 


Gly 


Glu 


Asp 


7\ n k\ 

Asn 




35 










40 










45 








Ser He 


Pro 


Glu 


Ser 


Cys 


Pro 


Asp 


Phe 


Cys 


Cys 


Gly 


Ser 


Cys 


Ser 


O A V 

ber 


50 










55 










60 










Gin Tyr Cys Cys 


Ser 


Asp 


Val 


Leu 


Lys 


Lys 


He 


Gin 


Trp 


Asn 


Glu 


Glu 


65 








70 










75 










80 


Met Cys 


Pro 


Glu 


Pro 


Glu 


Ser 


Ser 


Arg 


Phe 


Ser 


Ala 


His 


Pro 


Glu 


Thr 








85 










90 










95 




Pro Glu 


Gin 


Leu 


Gly 


Ser 


Ala 


Leu 


Lys 


Tyr 


Gin 


Ser 


Ser 


Leu 


Asp 


Ser 






100 










105 










110 






Asp Asn Met 


Pro 


Gly 


Phe 


Gly 


Ala 


Thr 


Val 


Ala 


He 


Gly 


Leu 


Thr 


val 




115 










120 










125 








Phe Val 


Val 


Phe 


He 


Ala 


Thr 


He 


He 


Val 


Cys 


Phe 


Thr 


Cys 


Ser 


Cys 


130 










135 










140 










Cys Cys 


Leu 


Tyr 


Lys 


Met 


Cys 


Cys 


Arg 


Pro 


Arg 


Pro 


Val 


Val 


Ser 


Asn 


145 








150 










155 










160 


Thr Thr 


Thr 


Thr 


Thr 


Ala 


Val 


His 


Thr 


Ala 


Tyr 


Pro 


Gin 


Pro 


Gin 


Pro 








165 










170 










175 




Val Ala 


Pro 


Ser 


Tyr 


Pro 


Gly 


Pro 


Thr 


Tyr 


Gin 


Gly 


Tyr 


His 


Pro 


Met 






180 










185 










190 






Pro Pro 


Gin 


Pro 


Gly 


Met 


Pro 


Ala 


Ala 


Pro 


Tyr 


Pro 


Thr 


Gin 


Tyr 


Pro 




195 










200 










205 








Pro Pro 


Tyr 


Leu 


Ala 


Gin 


Pro 


Thr 


Gly 


Pro 


Pro 


Ala 


Tyr 


His 


Glu 


Thr 


210 










215 










220 










Leu Ala Gly Ala 


Ser 


Gin 


Pro 


Pro 


Tyr 


Asn 


Pro 


Ala 


Tyr 


Met 


Asp 


Pro 


225 








230 










235 










240 



Pro Lys Ala Val Pro 
245 

<210> 153 

<211> 1801 

<212> DNA 

<213> Homo sapiens 

<400> 153 

gtcgacccac gcgtccggcg gaggttgtgg ctgcaccgtg gtcctgggct tggtcctggg 60 
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cttgatgcgt ctgtttgtcc gtccgtccgt ccgtcccgcc atggctgcgc cggcgccctc 120 

tccgtggacc ctttcgctgc tgctgttgtt gctactgccg tctccgggtg cccatggcga 180 

gctgtgcagg cccttcggtg aagacaattc gatcccagag tcctgtcctg acttctgttg 240 

tggctcctgt tccagccaat actgctgctc tgacgtgctg aagaaaatcc agtggaatga 300 

ggaaatgtgc cctgagccag agtccagcag attttccgcc cacccggaga caccagaaca 360 

gctgggttca gcgctgaagt atcagtccag tcttgacagt gacaacatgc cagggttcgg 42 0 

agcgaccgtg gccatcggcc tgaccgtctt cgtggtgttt atcgctacca tcattgtgtg 480 

ctttacctgc tcctgctgct gtctatataa gatgtgctgc cgcccacgac ctgtcgtgtc 540 

caacaccaca actactaccg cggttcacac cgcttaccct cagcctcaac ctgtggcccc 600 

cagctatcct ggaccaacat accagggcta ccatcccatg cccccccagc caggaatgcc 660 

agcagcaccc tacccaacgc agtaccctcc accctacctg gcccagccca cagggccacc 720 

agcctatcat gagacgttgg ctggagccag ccagcctcca tacaacccgg cctacatgga 780 

tcccccaaag gcagttccct gagcctgccc ccagcctctt tggctaacat ttgattatgt 84 0 

catgtgtgtg tgagtgctat gcagagttct ttactgctgt ctgtggtgcg tgtgccttgt 900 

ctagacatgt ggcttcctct gctgatgacc aggtaggcac aaatcttacc agtgctggtt 960 

gggaccaatc tgttttcttc ctcacttgaa attgtaattt ctgaaatttc aagtaaatta 1020 

aaaacaatag ggtaggaggt atttcccgct tcaccccaag gtgaccagcc atagcctgcc 1080 

acacatagga gagcaagctt tttgtgggtc catgtcctgc tttggggagt agccagctag 114 0 

ctgctgctat gggtttattc ccagggcttg gctgcattta gctggacaga gaacaagggg 1200 

cctcagtggc agtgggtcag tgactgatgt cagagcacac taggcagaga gccccgtccg 1260 

tctccatcag ctgtctgtct ggacggtccc actgtctttc ctgggactat gtagagggcc 1320 

acatgtattc actattcagg ctccagtggc ttccaggcca ggggcctctg tctactacac 1380 

actctggttt ctccctacag tgtcttttta cgattagcca aacatattgc ctgttttttg 1440 

tatccagatg tgtgataatt ggtgaggttg aaatccttgg ttcctggaga acaggaaacc 1500 

tgacctctga cagtccgttt cccttgacac cagcttcata gaatacctga ctcctgtact 1560 

acagtccagt ttgttccagt agcagggaca ccagggccag gggttatctg gaccaagggt 162 0 

gggggtggag agcctggatg gtagctctgg accagatgtg aatgcctcca tattccctgt 1680 

tggttcctgt ttcactggct gttttagttt tgtgttaatt ggtgtttctg agcattcaaa 1740 

ctccgcaccc tcgtttataa taaatgaata tttggaaaaa aaaaaaaaaa aaaaaaaaaa 1800 

a 1801 

<210> 154 
<211> 245 
<212> PRT 

<213> Homo sapiens 



<400> 154 
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Cys 
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245 



PCT/US00/16883 

170 175 
Tyr Gin Gly Tyr His Pro Met 
190 

Pro Tyr Pro Thr Gin Tyr Pro 
205 

Pro Pro Ala Tyr His Glu Thr 
220 

Asn Pro Ala Tyr Met Asp Pro 
235 240 



<210> 155 

<211> 1801 

<212> DNA 

<213> Homo sapiens 



<400> 155 

gtcgacccac gcgtccggcg gaggttgtgg ctgcaccgtg gtcctgggct tggtcctggg 60 

cttgatgcgt ctgtttgtcc gtccgtccgt ccgtcccgcc atggctgcgc cggcgccctc 12 0 

tccgtggacc ctttcgctgc tgctgttgtt gctactgccg tctccgggtg cccatggcga 180 

gctgtgcagg cccttcggtg aagacaattc gatcccagag tcctgtcctg acttctgttg 240 

tggctcctgt tccagccaat actgctgctc tgacgtgctg aagaaaatcc agtggaatga 300 

ggaaatgtgc cctgagccag agtccagcag attttccgcc cacccggaga caccagaaca 360 

gctgggttca gcgctgaagt atcagtccag tcttgacagt gacaacatgc cagggttcgg 420 

agcgaccgtg gccatcggcc tgaccgtctt cgtggtgttt atcgctacca tcattgtgtg 480 

ctttacctgc tcctgctgct gtctatataa gatgtgctgc cgcccacgac ctgtcgtgtc 540 

caacaccaca actactaccg tggttcacac cgcttaccct cagcctcaac ctgtggcccc 600 

cagctatcct ggaccaacat accagggcta ccatcccatg cccccccagc caggaatgcc 660 

agcagtaccc tacccaacgc agtaccctcc accctacctg gcccagccca cagggccacc 720 

agcctatcat gagacgttgg ctggagccag ccagcctcca tacaacccgg cctacatgga 780 

tcccccaaag gcagttccct gagcctgccc ccagcctctt tggctaacat ttgattatgt 840 

catgtgtgtg tgagtgctat gcagagttct ttactgctgt ctgtggtgcg tgtgccttgt 900 

ctagacatgt ggcttcctct gctgatgacc aggtaggcac aaatcttacc agtgctggtt 960 

gggaccaatc tgttttcttc ctcacttgaa attgtaattt ctgaaatttc aagtaaatta 1020 

aaaacaatag ggtaggaggt atttcccgct tcaccccaag gtgaccagcc atagcctgcc 1080 

acacatagga gagcaagctt tttgtgggtc catgtcctgc tttggggagt agccagctag 114 0 

ctgctgctat gggtttattc ccagggcttg gctgcattta gctggacaga gaacaagggg 1200 

cctcagtggc agtgggtcag tgactgatgt cagagcacac taggcagaga gccccgtccg 1260 

tctccatcag ctgtctgtct ggacggtccc actgtctttc ctgggactat gtagagggcc 1320 

acatgtattc actattcagg ctccagtggc ttccaggcca ggggcctctg tctactacac 1380 

actctggttt ctccctacag tgtcttttta cgattagcca aacatattgc ctgttttttg 1440 

tatccagatg tgtgataatt ggtgaggttg aaatccttgg ttcctggaga acaggaaacc 1500 

tgacctctga cagtccgttt cccttgacac cagcttcata gaatacctga ctcctgtact 1560 

acagtccagt ttgttccagt agcagggaca ccagggccag gggttatctg gaccaagggt 1620 

gggggtggag agcctggatg gtagctctgg accagatgtg aatgcctcca tattccctgt 1680 

tggttcctgt ttcactggct gttttagttt tgtgttaatt ggtgtttctg agcattcaaa 1740 

ctccgcaccc tcgtttataa taaatgaata tttggaaaaa aaaaaaaaaa aaaaaaaaaa 1800 

a 1801 

<210> 156 
<211> 245 
<212> PRT 

<213> Homo sapiens 
<400> 156 

Met Arg Leu Phe Val Arg Pro Ser Val Arg Pro Ala Met Ala Ala Pro 
15 10 15 
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245 



<210> 157 
<211> 1801 
<212> DNA 

<213> Homo sapiens 
<400> 157 

gtcgacccac gcgtccggcg gaggttgtgg ctgcaccgtg gtcctgggct tggtcctggg 60 

cttgatgcgt ctgtttgtcc gtccgtccgt ccgtcccgcc atggctgcgc cggcgccctc 12 0 

tccgtggacc ctttcgctgc tgctgttgtt gctactgccg tctccgggtg cccatggcga 18 0 

gctgtgcagg cccttcggtg aagacaattc gatcccagag tcctgtcctg acttctgttg 24 0 

tggctcctgt tccagccaat actgctgctc tgacgtgctg aagaaaatcc agtggaatga 300 

ggaaatgtgc cctgagccag agtccagcag attttccgcc cacccggaga caccagaaca 3 60 

gctgggttca gcgctgaagt atcagtccag tcttgacagt gacaacatgc cagggttcgg 42 0 

agcgaccgtg gccatcggcc tgaccgtctt cgtggtgttt atcgctacca tcattgtgtg 480 

ctttacctgc tcctgctgct gtctatataa gatgtgctgc cgcccacgac ctgtcgtgtc 54 0 

caacaccaca actactaccg tggttcacac cgcttaccct cagcctcaac ctgtggcccc 600 

cagctatcct ggaccaacat accagggcta ccatcccatg cccccccagc caggaatgcc 660 

agcagcaccc tacccaacgc agtaccctcc accctacctg gcccagccca cagggccacc 72 0 

agcctatcat gagacgttgg ctggagccag ccagcctcca tacaacccgg cctacatgga 780 

tcccccaaag gtagttccct gagcctgccc ccagcctctt tggctaacat ttgattatgt 840 

catgtgtgtg tgagtgctat gcagagttct ttactgctgt ctgtggtgcg tgtgccttgt 900 

ctagacatgt ggcttcctct gctgatgacc aggtaggcac aaatcttacc agtgctggtt 960 

gggaccaatc tgttttcttc ctcacttgaa attgtaattt ctgaaatttc aagtaaatta 1020 

aaaacaatag ggtaggaggt atttcccgct tcaccccaag gtgaccagcc atagcctgcc 1080 

acacatagga gagcaagctt tttgtgggtc catgtcctgc tttggggagt agccagctag 1140 

ctgctgctat gggtttattc ccagggcttg gctgcattta gctggacaga gaacaagggg 1200 

cctcagtggc agtgggtcag tgactgatgt cagagcacac taggcagaga gccccgtccg 1260 
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tctccatcag ctgtctgtct ggacggtccc actgtctttc ctgggactat gtagagggcc 1320 

acatgtattc actattcagg ctccagtggc ttccaggcca ggggcctctg tctactacac 1380 

actctggttt ctccctacag tgtcttttta cgattagcca aacatattgc ctgttttttg 1440 

tatccagatg tgtgataatt ggtgaggttg aaatccttgg ttcctggaga acaggaaacc 1500 

tgacctctga cagtccgttt cccttgacac cagcttcata gaatacctga ctcctgtact 1560 

acagtccagt ttgttccagt agcagggaca ccagggccag gggttatctg gaccaagggt 1620 

gggggtggag agcctggatg gtagctctgg accagatgtg aatgcctcca tattccctgt 1680 

tggttcctgt ttcactggct gttttagttt tgtgttaatt ggtgtttctg agcattcaaa 1740 

ctccgcaccc tcgtttataa taaatgaata tttggaaaaa aaaaaaaaaa aaaaaaaaaa 1800 

a 1801 



<210> 158 

<211> 213 

<212> PRT 

<213> Mus musculus 



<400> 158 
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210 



<210> 159 

<211> 1858 

<212> DMA 

<213> Mus musculus 



<400> 159 

gtcgacccac gcgtccgcgc ggaggttgcg gcggcaccgt ggtcttgggc ttggtccgtc 60 

tgttcgtccg tccgttggtc tgtcccgcca tggctgcgcc ggcgccctct ctgtggaccc 12 0 

tattgctgct gctgttgctg ctgccgccgc ctccgggtgc ccatggtgag ctgtgcaggc 180 

cctttggtga agacaattcg atcccagtgt tctgtcctga tttctgttgt ggttcctgtt 240 

ccaaccaata ctgctgctcg gacgtgctga ggaaaatcca gtggaatgag gaaatgtgtc 300 

ctgagccaga gtccagcaga ttttccaccc ccgcggagga gacacccgaa catctgggtt 360 
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cagcgctgaa atttcgatcc agttttgaca gtgaccctat gtcagggttc ggagcgaccg 420 

tcgccattgg cgtgaccatg tttgtggtgt ttattgccac tatcatcatc tgcttcacct 480 

gctcctgctg ctgtctgtat aagatgtgct gcccccaacg ccctgtcgtg accaacacca 540 

caactactac cgtggttcat gccccttacc ctcagcctca acctcaacct gtggccccca 600 

gctatcctgg accaacatac cagggctacc atcccatgcc ccccccagcc aggaatgcca 660 

gcagcaccct acccaacgca gtacccacca ccctacctgg cccagcccac agggccgcca 720 

ccctaccatg agtccttggc tggagccagc cagcctccat acaacccgac ctacatggat 780 

tccctaaaga caattccctg aacctgcccc cagcctcttt ggctgccatt tatgtcgtgt 840 

gtgagtgagt gatacgcaga gttctttact gctgtctgtg gtgtgtgtgc cttgtctaga 900 

catgtggctt cctctgctgt tgaccaggta ggcgcaagtc ttaccagtgt gggtcgggac 960 

caacctgttt tcttcctcac ttgaaattgt actttctgaa atttcaagca aattaaaaac 1020 

aataaggtag gaggtatttc ccacgtcacc ccaaggtgac cagccatggc ctgtcatact 1080 

taggagagca agctttttgc gggtacagag caggctttgg ggggtaacca gctagctgct 114 0 

gctaggcctt tattcccagg gtttggctgc attggcagtg aggcaggtgg ctgggggtga 1200 

caccaggtga caaggggact cagtggcagg gggtcacacc aggcagaaca ccatacactc 12 60 

tccatcagct gtctgtctgg atgtcactgt ccttcccggg gctgtataga gggccacatg 1320 

tgttcactat tcaggctcca ctgggggaat tttcctacct ttgctggctt ggctcctgct 1380 

cccaggccag ggacctcggt ctgtctacta cacactctgg tttctccctg cactgtcttt 1440 

ttactgttag ccaaacattt tgcctgtttt ctgtctccag atgtgtgata attggtgtga 1500 

ggttgaaatc cctggttcct ggaggacaga caacctgacc tccgactgtc agtttccctt 1560 

gacaccatct tcatagaaat acctgactcc tgtaccacag tccagtttgt cccagtagca 1620 

gggacaccaa ggccaatggg ttatctggac caaaggtggg gtggagggcc tagatggtat 1680 

ctccggccca gatgtgaata cctccatatt ccctgttggt tcctgtttca ctggctgttt 1740 

tagctttgtg ttgattggtg tttctgagca ttcagactcc gcaccctcat ttctaataaa 1800 

tgcaacattg gaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaagg gcggccgc 1858 

<210> 160 
<211> 213 
<212> PRT 

<213> Mus musculus 



<400> 160 
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He 
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He 
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Ala Ala Thr Leu Pro 
210 

<210> 161 
<211> 1858 
<212> DNA 

<213> Mus musculus 
<400> 161 

gtcgacccac gcgtccgcgc ggaggttgcg gcggcaccgt ggtcttgggc ttggtccgtc 60 

tgttcgtccg tccgttggtc tgtcccgcca tggctgcgcc ggcgccctct ctgtggaccc 120 

tattgctgct gctgttgctg ctgccgccgc ctccgggtgc ccatggtgag ctgtgcaggc 180 

cctttggtga agacaattcg atcccagtgt tctgtcctga tttctgttgt ggttcctgtt 240 

ccaaccaata ctgctgctcg gacgtgctga ggaaaatcca gtggaatgag gaaatgtgtc 300 

ctgagccaga gtccagcaga ttttccaccc ccgcggagga gacacccgaa catctgggtt 360 

cagcgctgaa atttcgatcc agttttgaca gtgaccctat gtcagggttc ggagcgaccg 420 

tcgccattgg cgtgaccatc tttgtggtgt ttattgtcac tatcatcatc tgcttcacct 480 

gctcctgctg ctgtctgtat aagatgtgct gcccccaacg ccctgtcgtg accaacacca 540 

caactactac cgtggttcat gccccttacc ctcagcctca acctcaacct gtggccccca 600 

gctatcctgg accaacatac cagggctacc atcccatgcc ccccccagcc aggaatgcca 660 

gcagcaccct acccaacgca gtacccacca ccctacctgg cccagcccac agggccgcca 720 

ccctaccatg agtccttggc tggagccagc cagcctccat acaacccgac ctacatggat 780 

tccctaaaga caattccctg aacctgcccc cagcctcttt ggctgccatt tatgtcgtgt 840 

gtgagtgagt gatacgcaga gttctttact gctgtctgtg gtgtgtgtgc cttgtctaga 900 

catgtggctt cctctgctgt tgaccaggta ggcgcaagtc ttaccagtgt gggtcgggac 960 

caacctgttt tcttcctcac ttgaaattgt actttctgaa atttcaagca aattaaaaac 1020 

aataaggtag gaggtatttc ccacgtcacc ccaaggtgac cagccatggc ctgtcatact 1080 

taggagagca agctttttgc gggtacagag caggctttgg ggggtaacca gctagctgct 1140 

gctaggcctt tattcccagg gtttggctgc attggcagtg aggcaggtgg ctgggggtga 1200 

caccaggtga caaggggact cagtggcagg gggtcacacc aggcagaaca ccatacactc 1260 

tccatcagct gtctgtctgg atgtcactgt ccttcccggg gctgtataga gggccacatg 1320 

tgttcactat tcaggctcca ctgggggaat tttcctacct ttgctggctt ggctcctgct 1380 

cccaggccag ggacctcggt ctgtctacta cacactctgg tttctccctg cactgtcttt 1440 

ttactgttag ccaaacattt tgcctgtttt ctgtctccag atgtgtgata attggtgtga 1500 

ggttgaaatc cctggttcct ggaggacaga caacctgacc tccgactgtc agtttccctt 1560 

gacaccatct tcatagaaat acctgactcc tgtaccacag tccagtttgt cccagtagca 1620 

gggacaccaa ggccaatggg ttatctggac caaaggtggg gtggagggcc tagatggtat 1680 

ctccggccca gatgtgaata cctccatatt ccctgttggt tcctgtttca ctggctgttt 1740 

tagctttgtg ttgattggtg tttctgagca ttcagactcc gcaccctcat ttctaataaa 1800 

tgcaacattg gaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaagg gcggccgc 1858 

<210> 162 
<211> 213 
<212> PRT 

<213> Mus musculus 



<400> 162 



Met 


Ala 


Ala 


Pro 


Ala 


Pro 


Ser 


Leu 


Trp 


Thr 


Leu 


Leu 


Leu 


Leu 


Leu 


Leu 


1 








5 










10 










15 




Leu 


Leu 


Pro 


Pro 


Pro 


Pro Gly Ala 


His 


Gly 


Glu 


Leu 


Cys 


Arg 


Pro 


Phe 








20 










25 










30 






Gly Glu Asp 


Asn 


Ser 


He 


Pro 


Val 


Phe 


Cys 


Pro Asp 


Phe 


Cys 


Cys 


Gly 






35 










40 










45 








Ser 


Cys 
50 


Ser 


Asn 


Gin 


Tyr 


Cys 
55 


Cys 


Ser 


Asp 


Val 


Leu 
60 


Arg 


Lys 


He 


Gin 


Trp 


Asn 


Glu 


Glu 


Met 


Cys 


Pro 


Glu 


Pro 


Glu 


Ser 


Ser Arg 


Phe 


Ser 


Thr 


65 










70 










75 










80 


Pro 


Ala 


Glu 


Glu 


Thr 


Pro 


Glu 


His 


Leu 


Gly 


Ser 


Ala 


Leu 


Lys 


Phe 


Arg 
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85 










90 








95 




Ser 


Ser 


Phe 


Asp 
100 


ser 


Asp 


Pro 


Met 


Ser 
105 


Gly 


Phe Gly 


Ala 


Thr 
110 


Val 


Ala 


lie 


Gly 


Val 


Thr 


He 


Phe 


Val 


Val 


Phe 


He 


Ala Thr 


He 


He 


He 


Cvs 






115 










120 








125 






Phe 


Thr 
130 


Cys 


Ser 


Cys 


Cys 


Cys 
135 


Leu 


Tyr 


Lys 


Met Cys 
14 0 


Cys 


Pro 


Gin 


Ar 9 


Pro 


Val 


Val 


Thr 


Asn 


Thr 


Thr 


Thr 


Thr 


Thr 


Val Ala 


His 


Ala 


Pro 


Tyr 


145 










150 










155 








160 


Pro 


Gin 


Pro 


Gin 


Pro 
165 


Gin 


Pro 


Val 


Ala 


Pro 
170 


Ser Tyr 


Pro 


Gly 


Pro 
175 


Thr 


Tyr 


Gin 


Gly 


Tyr 
180 


His 


Pro 


Met 


Pro 


Pro 
185 


Pro 


Ala Arg 


Asn 


Ala 

1 9ft 


Ser 


Ser 


Thr 


Leu 


Pro 


Asn 


Ala 


Val 


Pro 


Thr 


Thr 


Leu 


Pro Gly 


Pro 


Ala 


His 


Arg 






195 










200 








205 






Ala 


Ala 
210 


Thr 


Leu 


Pro 























<210> 163 

<211> 1858 

<212> DNA 

<213> Mus musculus 

<400> 163 

gtcgacccac gcgtccgcgc ggaggttgcg gcggcaccgt ggtcttgggc ttggtccgtc 60 

tgttcgtccg tccgttggtc tgtcccgcca tggctgcgcc ggcgccctct ctgtggaccc 120 

tattgctgct gctgttgctg ctgccgccgc ctccgggtgc ccatggtgag ctgtgcaggc 180 

cctttggtga agacaattcg atcccagtgt tctgtcctga tttctgttgt ggttcctgtt 240 

ccaaccaata ctgctgctcg gacgtgctga ggaaaatcca gtggaatgag gaaatgtgtc 300 

ctgagccaga gtccagcaga ttttccaccc ccgcggagga gacacccgaa catctgggtt 360 

cagcgctgaa atttcgatcc agttttgaca gtgaccctat gtcagggttc ggagcgaccg 420 

tcgccattgg cgtgaccatc tttgtggtgt ttattgccac tatcatcatc tgcttcacct 480 

gctcctgctg ctgtctgtat aagatgtgct gcccccaacg ccctgtcgtg accaacacca 540 

caactactac cgtggctcat gccccttacc ctcagcctca acctcaacct gtggccccca 600 

gctatcctgg accaacatac cagggctacc atcccatgcc ccccccagcc aggaatgcca 660 

gcagcaccct acccaacgca gtacccacca ccctacctgg cccagcccac agggccgcca 720 

ccctaccatg agtccttggc tggagccagc cagcctccat acaacccgac ctacatggat 780 

tccctaaaga caattccctg aacctgcccc cagcctcttt ggctgccatt tatgtcgtgt 840 

gtgagtgagt gatacgcaga gttctttact gctgtctgtg gtgtgtgtgc cttgtctaga 900 

catgtggctt cctctgctgt tgaccaggta ggcgcaagtc ttaccagtgt gggtcgggac 960 

caacctgttt tcttcctcac ttgaaattgt actttctgaa atttcaagca aattaaaaac 1020 

aataaggtag gaggtatttc ccacgtcacc ccaaggtgac cagccatggc ctgtcatact 1080 

taggagagca agctttttgc gggtacagag caggctttgg ggggtaacca gctagctgct 1140 

gctaggcctt tattcccagg gtttggctgc attggcagtg aggcaggtgg ctgggggtga 1200 

caccaggtga caaggggact cagtggcagg gggtcacacc aggcagaaca ccatacactc 1260 

tccatcagct gtctgtctgg atgtcactgt ccttcccggg gctgtataga gggccacatg 1320 

tgttcactat tcaggctcca ctgggggaat tttcctacct ttgctggctt ggctcctgct 1380 

cccaggccag ggacctcggt ctgtctacta cacactctgg tttctccctg cactgtcttt 1440 

ttactgttag ccaaacattt tgcctgtttt ctgtctccag atgtgtgata attggtgtga 1500 

ggttgaaatc cctggttcct ggaggacaga caacctgacc tccgactgtc agtttccctt 1560 

gacaccatct tcatagaaat acctgactcc tgtaccacag tccagtttgt cccagtagca 1620 

gggacaccaa ggccaatggg ttatctggac caaaggtggg gtggagggcc tagatggtat 1680 

ctccggccca gatgtgaata cctccatatt ccctgttggt tcctgtttca ctggctgttt 1740 

tagctttgtg ttgattggtg tttctgagca ttcagactcc gcaccctcat ttctaataaa 1800 

tgcaacattg gaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaagg gcggccgc 1858 

210> 164 
211> 213 
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<212> PRT 

<213> Mus musculus 



<400> 164 



Met 


Ala 


Ala 


Pro 


Ala 


Pro 


Ser 


Leu 


Trp 


Thr 


Leu 


Leu 


Leu 


Leu 


Leu 


Leu 


1 








5 










10 
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Leu 


Leu 


Pro 


Pro 


Pro 


Pro 


Gly 


Ala 


His 


Gly 


Glu 


Leu 


Cys 


TV 

Arg 


Pro 


Pne 








20 
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1 A 
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Gly 
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TV mm 

Asp 


Asn 


Ser 


lie 


Pro 


val 


Pne 


Cys 


Pro 


TV e+its. 

Asp 


Pne 


Cys 


Cys 


Gly 
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40 










45 








Ser 


Cys 


Ser 


Asn 


Gin 


Tyr 


Cys 


Cys 


Ser 


Asp 


Val 


Leu 


Arg 


Lys 


lie 


Gin 




50 










55 










60 










Trp 


Asn 


Glu 


Glu 


Met 


Cys 


Pro 


/IT 

GlU 


Pro 


Glu 


Ser 


Ser 


TV v/-* 

Arg 


pne 


Ser 


Tnr 
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70 










75 










80 


Pro 


Ala 


Glu 


Glu 


Thr 


Pro 


Glu 


His 


Leu 


Gly 


Ser 


Ala 


Leu 


Lys 


Phe 


Arg 










85 










90 










95 




Ser 


Ser 


Phe 


Asp 


Ser 


Asp 


Pro 


Met 


Ser 


Gly 


Phe 


Gly 


Ala 


Thr 


Val 


TV 1 — 

Ala 
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110 






lie 


Gly 


Val 


Thr 


lie 


Phe 


val 


Val 


Pne 


lie 


Ala 


Thr 


lie 


lie 


lie 


Cys 






115 










120 










125 








Phe 


Thr 


Cys 


Ser 


Cys 


Cys 


Cys 


Leu 


Tyr 


Lys 


Met 


Cys 


Cys 


Pro 


Gin 


Arg 
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140 










Pro 


Val 


Val 


Thr 


Asn 


Thr 


Thr 


Thr 


Thr 


Thr 


Val 


Val 


His 


Ala 


Pro 


Tyr 
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150 
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160 


Pro 


Gin 


Pro 


Gin 


Pro 


Gin 


Pro 


Val 


Ala 


Pro 


Ser 


Tyr 


Pro 


Gly 


Pro 


Thr 










165 










170 










175 




Tyr 


Gin 


Gly 


Tyr 


His 


Pro 


Met 


Pro 


Pro 


Pro 


Ala 


Arg 


Asn 


Ala 


Ser 


Ser 








180 
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Thr 
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Pro 


Asn 


Val 


Val 


Pro 


Thr 


Thr 


Leu 


Pro 
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Pro 
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His 
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210 



<210> 165 

<211> 1858 

<212> DNA 

<213> Mus musculus 

<400> 165 

gtcgacccac gcgtccgcgc ggaggttgcg gcggcaccgt ggtcttgggc ttggtccgtc 60 

tgttcgtccg tccgttggtc tgtcccgcca tggctgcgcc ggcgccctct ctgtggaccc 120 

tattgctgct gctgttgctg ctgccgccgc ctccgggtgc ccatggtgag ctgtgcaggc 180 

cctttggtga agacaattcg atcccagtgt tctgtcctga tttctgttgt ggttcctgtt 240 

ccaaccaata ctgctgctcg gacgtgctga ggaaaatcca gtggaatgag gaaatgtgtc 3 00 

ctgagccaga gtccagcaga ttttccaccc ccgcggagga gacacccgaa catctgggtt 360 

cagcgctgaa atttcgatcc agttttgaca gtgaccctat gtcagggttc ggagcgaccg 420 

tcgccattgg cgtgaccatc tttgtggtgt ttattgccac tatcatcatc tgcttcacct 480 

gctcctgctg ctgtctgtat aagatgtgct gcccccaacg ccctgtcgtg accaacacca 540 

caactactac cgtggttcat gccccttacc ctcagcctca acctcaacct gtggccccca 600 

gctatcctgg accaacatac cagggctacc atcccatgcc ccccccagcc aggaatgcca 660 

gcagcaccct acccaacgta gtacccacca ccctacctgg cccagcccac agggccgcca 72 0 

ccctaccatg agtccttggc tggagccagc cagcctccat acaacccgac ctacatggat 780 

tccctaaaga caattccctg aacctgcccc cagcctcttt ggctgccatt tatgtcgtgt 840 

gtgagtgagt gatacgcaga gttctttact gctgtctgtg gtgtgtgtgc cttgtctaga 900 

catgtggctt cctctgctgt tgaccaggta ggcgcaagtc ttaccagtgt gggtcgggac 960 

caacctgttt tcttcctcac ttgaaattgt actttctgaa atttcaagca aattaaaaac 1020 

aataaggtag gaggtatttc ccacgtcacc ccaaggtgac cagccatggc ctgtcatact 1080 

taggagagca agctttttgc gggtacagag caggctttgg ggggtaacca gctagctgct 1140 
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gctaggcctt tattcccagg gtttggctgc attggcagtg aggcaggtgg ctgggggtga 1200 

caccaggtga caaggggact cagtggcagg gggtcacacc aggcagaaca ccatacactc 1260 

tccatcagct gtctgtctgg atgtcactgt ccttcccggg gctgtataga gggccacatg 1320 

tgttcactat tcaggctcca ctgggggaat tttcctacct ttgctggctt ggctcctgct 13 80 

cccaggccag ggacctcggt ctgtctacta cacactctgg tttctccctg cactgtcttt 1440 

ttactgttag ccaaacattt tgcctgtttt ctgtctccag atgtgtgata attggtgtga 1500 

ggttgaaatc cctggttcct ggaggacaga caacctgacc tccgactgtc agtttccctt 1560 

gacaccatct tcatagaaat acctgactcc tgtaccacag tccagtttgt cccagtagca 1620 

gggacaccaa ggccaatggg ttatctggac caaaggtggg gtggagggcc tagatggtat 1680 

ctccggccca gatgtgaata cctccatatt ccctgttggt tcctgtttca ctggctgttt 1740 

tagctttgtg ttgattggtg tttctgagca ttcagactcc gcaccctcat ttctaataaa 1800 

tgcaacattg gaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaagg gcggccgc 1858 

<210> 166 
<211> 639 
<212> DNA 
<213> Mus musculus 

<400> 166 

atggctgcgc cggcgccctc tctgtggacc ctattgctgc tgctgttgct gctgccgccg 60 

cctccgggtg cccatggtga gctgtgcagg ccctttggtg aagacaattc gatcccagtg 12 0 

ttctgtcctg atttctgttg tggttcctgt tccaaccaat actgctgctc ggacgtgctg 180 

aggaaaatcc agtggaatga ggaaatgtgt cctgagccag agtccagcag attttccacc 240 

cccgcggagg agacacccga acatctgggt tcagcgctga aatttcgatc cagttttgac 3 00 

" agtgacccta tgtcagggtt cggagcgacc gtcgccattg gcgtgaccat gtttgtggtg 360 

tttattgcca ctatcatcat ctgcttcacc tgctcctgct gctgtctgta taagatgtgc 420 

tgcccccaac gccctgtcgt gaccaacacc acaactacta ccgtggttca tgccccttac 480- 

cctcagcctc aacctcaacc tgtggccccc agctatcctg gaccaacata ccagggctac 540 

catcccatgc cccccccagc caggaatgcc agcagcaccc tacccaacgc agtacccacc 600 

accctacctg gcccagccca cagggccgcc accctacca 639 

<210> 167 
<211> 639 
<212> DNA 
<213> Mus musculus 

<400> 167 

atggctgcgc cggcgccctc tctgtggacc ctattgctgc tgctgttgct gctgccgccg 60 

cctccgggtg cccatggtga gctgtgcagg ccctttggtg aagacaattc gatcccagtg 120 

ttctgtcctg atttctgttg tggttcctgt tccaaccaat actgctgctc ggacgtgctg 180 

aggaaaatcc agtggaatga ggaaatgtgt cctgagccag agtccagcag attttccacc 240 

cccgcggagg agacacccga acatctgggt tcagcgctga aatttcgatc cagttttgac 300 

agtgacccta tgtcagggtt cggagcgacc gtcgccattg gcgtgaccat ctttgtggtg 360 

tttattgtca ctatcatcat ctgcttcacc tgctcctgct gctgtctgta taagatgtgc 420 

tgcccccaac gccctgtcgt gaccaacacc acaactacta ccgtggttca tgccccttac 480 

cctcagcctc aacctcaacc tgtggccccc agctatcctg gaccaacata ccagggctac 540 

catcccatgc cccccccagc caggaatgcc agcagcaccc tacccaacgc agtacccacc 600 

accctacctg gcccagccca cagggccgcc accctacca 63 9 

<210> 168 
<211> 639 
<212> DNA 

<213> Mus musculus 
<400> 168 

atggctgcgc cggcgccctc tctgtggacc ctattgctgc tgctgttgct gctgccgccg 60 

cctccgggtg cccatggtga gctgtgcagg ccctttggtg aagacaattc gatcccagtg 120 

ttctgtcctg atttctgttg tggttcctgt tccaaccaat actgctgctc ggacgtgctg 180 
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aggaaaatcc agtggaatga ggaaatgtgt cctgagccag agtccagcag attttccacc 240 

cccgcggagg agacacccga acatctgggt tcagcgctga aatttcgatc cagttttgac 300 

agtgacccta tgtcagggtt cggagcgacc gtcgccattg gcgtgaccat ctttgtggtg 360 

tttattgcca ctatcatcat ctgcttcacc tgctcctgct gctgtctgta taagatgtgc 420 

tgcccccaac gccctgtcgt gaccaacacc acaactacta ccgtggctca tgccccttac 480 

cctcagcctc aacctcaacc tgtggccccc agctatcctg gaccaacata ccagggctac 540 

catcccatgc cccccccagc caggaatgcc agcagcaccc tacccaacgc agtacccacc 600 

accctacctg gcccagccca cagggccgcc accctacca 639 

<210> 169 
<211> 639 
<212> DNA 

<213> Mus musculus 



<400> 169 

atggctgcgc cggcgccctc tctgtggacc ctattgctgc tgctgttgct gctgccgccg 60 

cctccgggtg cccatggtga gctgtgcagg ccctttggtg aagacaattc gatcccagtg 12 0 

ttctgtcctg atttctgttg tggttcctgt tccaaccaat actgctgctc ggacgtgctg 180 

aggaaaatcc agtggaatga ggaaatgtgt cctgagccag agtccagcag attttccacc 240 

cccgcggagg agacacccga acatctgggt tcagcgctga aatttcgatc cagttttgac 300 

agtgacccta tgtcagggtt cggagcgacc gtcgccattg gcgtgaccat ctttgtggtg 360 

tttattgcca ctatcatcat ctgcttcacc tgctcctgct gctgtctgta taagatgtgc 420 

tgcccccaac gccctgtcgt gaccaacacc acaactacta ccgtggttca tgccccttac 4 80 

cctcagcctc aacctcaacc tgtggccccc agctatcctg gaccaacata ccagggctac 540 

catcccatgc cccccccagc caggaatgcc agcagcaccc tacccaacgt agtacccacc 600 

accctacctg gcccagccca cagggccgcc accctacca 63 9 

<210> 170 
<211> 1218 
<212> DNA 

<213> Homo sapiens 



<400> 170 

atggggccca gcacccctct cctcatcttg ttccttttgt catggtcggg acccctccaa 60 

ggacagcagc accaccttgt ggagtacatg gaacgccgac tagctgcttt agaggaacgg 12 0 

ctggcccagt gccaggacca gagtagtcgg catgctgctg agctgcggga cttcaagaac 180 

aagatgctgc cactgctgga ggtggcagag aaggagcggg aggcactcag aactgaggcc 24 0 

gacaccatct ccgggagagt ggatcgtctg gagcgggagg cagactatct ggagacccag 300 

aacccagctc tgccctgtgt agagtttgat gagaaggtga ctggaggccc tgggaccaaa 360 

ggcaagggaa gaaggaatga gaagtacgat atggtgacag actgtggcta cacaatctct 420 

caagtgagat caatgaagat tctgaagcga tttggtggcc cagctggtct atggaccaag 480 

gatccactgg ggcaaacaga gaagatctac gtgttagatg ggacacagaa tgacacagcc 540 

tttgtcttcc caaggctgcg tgacttcacc cttgccatgg ctgcccggaa agcttcccga 600 

gtccgggtgc ccttcccctg ggtaggcaca gggcagctgg tatatggtgg ctttctttat 660 

tttgctcgga ggcctcctgg aagacctggt ggaggtggtg agatggagaa cactttgcag 72 0 

ctaatcaaat tccacctggc aaaccgaaca gtggtggaca gctcagtatt cccagcagag 780 

gggctgatcc ccccctacgg cttgacagca gacacctaca tcgacctggc agctgatgag 840 

gaaggtcttt gggctgtcta tgccacccgg gaggatgaca ggcacttgtg tctggccaag 900 

ttagatccac agacactgga cacagagcag cagtgggaca caccatgtcc cagagagaat 960 

gctgaggctg cctttgtcat ctgtgggacc ctctatgtcg tctataacac ccgtcctgcc 102 0 

agtcgggccc gcatccagtg ctcctttgat gccagcggca ccctgacccc tgaacgggca 108 0 

gcactccctt attttccccg cagatatggt gcccatgcca gcctccgcta taacccccga 1140 

gaacgccagc tctatgcctg ggatgatggc taccagattg tctataagct ggagatgagg 1200 

aagaaagagg aggaggtt 1218 



<210> 171 
<211> 1218 
<212> DNA 
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<213> Homo sapiens 
<400> 171 

atggggccca gcacccctct cctcatcttg ttccttttgt catggtcggg acccctccaa 60 

ggacagcagc accaccttgt ggagtacatg gaacgccgac tagctgcttt agaggaacgg 120 

ctggcccagt gccaggacca gagtagtcgg catgctgctg agctgcggga cttcaagaac 180 

aagatgctgc cactgctgga ggtggcagag aaggagcggg aggcactcag aactgaggcc 24 0 

gacaccatct ccgggagagt ggatcgtctg gagcgggagg tagactatct ggagacccag 300 

aacccagctc tgccctgtgt agagtttgat gagaaggtga ctggaggccc tgggaccaaa 360 

ggcaagggaa gaaggaatga gaagtacgat atggtgacag actgtggcta cacaatctct 420 

caagtgagat caatgaagat tctgaagcga tttggtggcc cagctggtct atggaccaag 4 80 

gatccactgg ggcaaacaga gaagatctac gtgttagatg ggacacagaa tgacacagcc 540 

tttgtcttcc caaggctgcg tgacttcacc cttgccatgg ctgcccggaa agcttcccga 600 

gtccgggtgc ccttcccctg ggtaggcaca gggcagctgg tatatggtgg ctttctttat 660 

tttgctcgga ggcctcctgg aagacctggt ggaggtggtg agatggagaa cactttgcag 720 

ctaatcaaat tccacctggc aaaccgaaca gtggtggaca gctcagtatt cccagcagag 7 80 

gggctgatcc ccccctacgg cttgacagca gacacctaca tcgacctggc agctgatgag 84 0 

gaaggtcttt gggctgtcta tgccacccgg gaggatgaca ggcacttgtg tctggccaag 900 

ttagatccac agacactgga cacagagcag cagtgggaca caccatgtcc cagagagaat 960 

gctgaggctg cctttgtcat ctgtgggacc ctctatgtcg tctataacac ccgtcctgcc 1020 

agtcgggccc gcatccagtg ctcctttgat gccagcggca ccctgacccc tgaacgggca 1080 

gcactccctt attttccccg cagatatggt gcccatgcca gcctccgcta taacccccga 1140 

gaacgccagc tctatgcctg ggatgatggc taccagattg tctataagct ggagatgagg 12 00 

aagaaagagg aggaggtt 1218 

<210> 172 
<211> 1219 
<212> DNA 

<213> Homo sapiens 
<400> 172 

catggggccc agcacccctc tcctcatctt gttccttttg tcatggtcgg gacccctcca 60 

aggacagcag caccaccttg tggagtacat ggaacgccga ctagctgctt tagaggaacg 12 0 

gctggcccag tgccaggacc agagtagtcg gcatgctgct gagctgcggg acttcaagaa 180 

caagatgctg ccactgctgg aggtggcaga gaaggagcgg gaggcactca gaactgaggc 24 0 

cgacaccatc tccgggagag tggatcgtct ggagcgggag gtagactatc tggagaccca 300 

gaacccagct ctgccctgtg tagagtttga tgagaaggtg actggaggcc ctgggaccaa 360 

aggcaaggga agaaggaatg agaagtacga tatggtgaca gactgtggct acacaatctc 42 0 

tcaagtgaga tcaatgaaga ttctgaagcg atttggtggc ccagctggta tatggaccaa 480 

• ggatccactg gggcaaacag agaagatcta cgtgttagat gggacacaga atgacacagc 540 

ctttgtcttc ccaaggctgc gtgacttcac ccttgccatg gctgcccgga aagcttcccg 600 

agtccgggtg cccttcccct gggtaggcac agggcagctg gtatatggtg gctttcttta 660 

ttttgctcgg aggcctcctg gaagacctgg tggaggtggt gagatggaga acactttgca 720 

gctaatcaaa ttccacctgg caaaccgaac agtggtggac agctcagtat tcccagcaga -780 

ggggctgatc cccccctacg gcttgacagc agacacctac atcgacctgg cagctgatga 84 0 

ggaaggtctt tgggctgtct atgccacccg ggaggatgac aggcacttgt gtctggccaa 900 

gttagatcca cagacactgg acacagagca gcagtgggac acaccatgtc ccagagagaa 960 

tgctgaggct gcctttgtca tctgtgggac cctctatgtc gtctataaca cccgtcctgc 1020 

cagtcgggcc cgcatccagt gctcctttga tgccagcggc accctgaccc ctgaacgggc 1080 

agcactccct tattttcccc gcagatatgg tgcccatgcc agcctccgct ataacccccg 1140 

agaacgccag ctctatgcct gggatgatgg ctaccagatt gtctataagc tggagatgag 1200 

gaagaaagag gaggaggtt 1219 

<210> 173 
<211> 1218 
<212> DNA 

<213> Homo sapiens 
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<400> 173 

atggggccca gcacccctct cctcatcttg ttccttttgt catggtcggg acccctccaa 60 

ggacagcagc accaccttgt ggagtacatg gaacgccgac tagctgcttt agaggaacgg 120 

ctggcccagt gccaggacca gagtagtcgg catgctgctg agctgcggga cttcaagaac 180 

aagatgctgc cactgctgga ggtggcagag aaggagcggg aggcactcag aactgaggcc 240 

gacaccatct ccgggagagt ggatcgtctg gagcgggagg tagactatct ggagacccag 300 

aacccagctc tgccctgtgt agagtttgat gagaaggtga ctggaggccc tgggaccaaa 360 

ggcaagggaa gaaggaatga gaagtacgat atggtgacag actgtggcta cacaatctct 420 

caagtgagat caatgaagat tctgaagcga tttggtggcc cagctggtct atggaccaag 4 80 

gatccactgg ggcaaacaga gaagatctac gtgttagatg ggacacagaa tgacacagtc 540 

tttgtcttcc caaggctgcg tgacttcacc cttgccatgg ctgcccggaa agcttcccga 600 

gtccgggtgc ccttcccctg ggtaggcaca gggcagctgg tatatggtgg ctttctttat 660 

tttgctcgga ggcctcctgg aagacctggt ggaggtggtg agatggagaa cactttgcag 720 

ctaatcaaat tccacctggc aaaccgaaca gtggtggaca gctcagtatt cccagcagag 780 

gggctgatcc ccccctacgg cttgacagca gacacctaca tcgacctggc agctgatgag 840 

gaaggtcttt gggctgtcta tgccacccgg gaggatgaca ggcacttgtg tctggccaag 900 

ttagatccac agacactgga cacagagcag cagtgggaca caccatgtcc cagagagaat 960 

gctgaggctg cctttgtcat ctgtgggacc ctctatgtcg tctataacac ccgtcctgcc 1020 

agtcgggccc gcatccagtg ctcctttgat gccagcggca ccctgacccc tgaacgggca 1080 

gcactccctt attttccccg cagatatggt gcccatgcca gcctccgcta taacccccga 1140 

gaacgccagc tctatgcctg ggatgatggc taccagattg tctataagct ggagatgagg 1200 

aagaaagagg aggaggtt 1218 

<210> 174 
<211> 729 
<212> DNA 

<213> Mus musculus 



<400> 174 

atgaggccac ttcttgccct tctgcttctg ggtctggtgt caggctctcc tcctctggac 60 

gacaacaaga tccccagcct gtgtcccggg cagcccggcc ttccaggcac accaggtcac 120 

catggcagcc aaggcctgcc tggccgtgac ggccgtgatg gccgcgacgg tgcacccgga 180 

gctccgggag agaaaggcga gggcgggaga ccgggactac ctggcccacg tggggagccc 24 0 

gggccgcgtg gagaggtagg gcccatgggg gctatcgggc ctgcggggga gtgctcggta 300 

cccccacgat cagccttcag tgccaagcga tccgagagcc gggtacctcc gccagccgac 360 

acacccctac ctttcgaccg tgtgctgcta aatgagcagg gccattacga ccccactact 420 

ggcaagttca cctgccaagt gcctggcgtc tactactttg ctgtgcacgc cactgtctac 480 

cgggccagct tgcagtttga tcttgtcaaa aacgggcagt ccatcgcctc tttcttccag 540 

tattttgggg ggtggcccaa gccagcctcg ctctcagggg gtgcgatggt aaggctagaa 600 

cctgaggacc aggtgtgggt gcaggtgggc gtgggtgatt acattggcat ctatgccagc 660 

atcaagacag acagtacctt ctctggattt ctcgtctatt ctgactggca cagctcccca 720 

gtcttcgct 729 

<210> 175 
<211> 729 
<212> DNA 

<213> Mus musculus 



<400> 175 

atgaggccac ttcttgccct tctgcttctg ggtctggtgt caggctctcc tcctctggac 60 

gacaacaaga tccccagcct gtgtcccggg cagcccggcc ttccaggcac accaggtcac 12 0 

catggcagcc aaggcctgcc tggccgtgac ggccgtgatg gccgcgacgg tgcacccgga 180 

gctccgggag agaaaggcga gggcgggaga ccgggactac ctggcccacg tggggagccc 240 

gggccgcgtg gagaggcagg gcccatgggg gctatcgggc ctgcggggga gtgctcggta 300 

cccccacgat cagtcttcag tgccaagcga tccgagagcc gggtacctcc gccagccgac 360 

acacccctac ctttcgaccg tgtgctgcta aatgagcagg gccattacga ccccactact 420 

ggcaagttca cctgccaagt gcctggcgtc tactactttg ctgtgcacgc cactgtctac 480 

cgggccagct tgcagtttga tcttgtcaaa aacgggcagt ccatcgcctc tttcttccag 540 
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tattttgggg ggtggcccaa gccagcctcg ctctcagggg gtgcgatggt aaggctagaa 600 

cctgaggacc aggtgtgggt gcaggtgggc gtgggtgatt acattggcat ctatgccagc 660 

atcaagacag acagtacctt ctctggattt ctcgtctatt ctgactggca cagctcccca 720 

gtcttcgct 729 

<210> 176 
<211> 729 
<212> DNA 

<213> Mus musculus 
<400> 176 

atgaggccac ttcttgccct tctgcttctg ggtctggtgt caggctctcc tcctctggac 60 

gacaacaaga tccccagcct gtgtcccggg cagcccggcc ttccaggcac accaggtcac 120 

catggcagcc aaggcctgcc tggccgtgac ggccgtgatg gccgcgacgg tgcacccgga 180 

gctccgggag agaaaggcga gggcgggaga ccgggactac ctggcccacg tggggagccc 24 0 

gggccgcgtg gagaggcagg gcccatgggg gctatcgggc ctgcggggga gtgctcggta 300 

cccccacgat cagccttcag tgccaagcga tccgagagcc gggtacctcc gccagccgac 360 

acacccctac ctttcgaccg tgcgctgcta aatgagcagg gccattacga ccccactact 420 

ggcaagttca cctgccaagt gcctggcgtc tactactttg ctgtgcacgc cactgtctac 480 

cgggccagct tgcagtttga tcttgtcaaa aacgggcagt ccatcgcctc tttcttccag 540 

tattttgggg ggtggcccaa gccagcctcg ctctcagggg gtgcgatggt aaggctagaa 600 

cctgaggacc aggtgtgggt gcaggtgggc gtgggtgatt acattggcat ctatgccagc 660 

atcaagacag acagtacctt ctctggattt ctcgtctatt ctgactggca cagctcccca 720 

gtcttcgct 729 

<210> 177 
<211> 729 
<212> DNA 

<213> Mus musculus 
<400> 177 

atgaggccac ttcttgccct tctgcttctg ggtctggtgt caggctctcc tcctctggac 60 

gacaacaaga tccccagcct gtgtcccggg cagcccggcc ttccaggcac accaggtcac 120 

catggcagcc aaggcctgcc tggccgtgac ggccgtgatg gccgcgacgg tgcacccgga 180 

gctccgggag agaaaggcga gggcgggaga ccgggactac ctggcccacg tggggagccc 24 0 

gggccgcgtg gagaggcagg gcccatgggg gctatcgggc ctgcggggga gtgctcggta 3 00 

cccccacgat cagccttcag tgccaagcga tccgagagcc gggtacctcc gccagccgac 360 

acacccctac ctttcgaccg tgtgctgcta aatgagcagg gccattacga ccccactact 420 

ggcaagttca cctgccaagt gcctggcgtc tactactttg ctgtgcacgc cactgtctac 480 

cgggccagct tgcagtttga tattgtcaaa aacgggcagt ccatcgcctc tttcttccag 540 

tattttgggg ggtggcccaa gccagcctcg ctctcagggg gtgcgatggt aaggctagaa 600 

cctgaggacc aggtgtgggt gcaggtgggc gtgggtgatt acattggcat ctatgccagc 660 

atcaagacag acagtacctt ctctggattt ctcgtctatt ctgactggca cagctcccca 720 

gtcttcgct 729 

<210> 178 
<211> 1218 
<212> DNA 

<213> Mus musculus 
<400> 178 

atggggccca gtgctcctct gctgctcctc ttctttttgt catggacggg accccttcag 60 

ggacagcagc accaccttgt ggagtacatg gaacgccgac tagctgcctt agaggaacgg 120 

ctggcccaat gccaggatca gagtagtcgg catgctgccg agcttcggga cttcaaaaac 180 

aagatgttgc ctctcctgga ggtggcagag aaggagcggg agaccctcag aactgaagca 240 

gactccatct caggaagagt ggaccgtatt gaaagggagg tagactatct ggagacacag 3 00 

aacccagctt tgccctgtgt agagctggat gagaaggtga ctggaggtcc tggagccaaa 360 

ggcaagggcc gaagaaatga gaaatacgat atggtgacgg actgtagcta cacagtcgct 420 
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caggtgaggt caatgaagat cctgaagcgg tttggtggtt cagttggcct atggaccaag 480 

gatccgctgg ggccagcaga gaagatctac gtgttagacg gcacccagaa cgacacggct 540 

tttgtcttcc caaggctgcg tgacttcacc cttgccatgg ctgcccggaa agcttcccga 600 

attcgggtgc ccttcccctg ggtaggcacg gggcagctgg tgtacggtgg cttcctttat 660 

tatgctcgaa ggcctcctgg aggacctgga gggggtggtg aattggagaa cactctgcag 720 

ctgatcaaat ttcacttggc aaaccgaaca gtggtggata gctcagtgtt ccctgcagag 780 

agcctgatac ccccctacgg cctgacagca gatacatata tcgacctggc agctgatgag 840 

gagggcctgt gggctgtcta tgccactcga gatgatgaca ggcatttgtg tctagccaag 900 

ttagacccac agacacttga cacagagcag cagtgggaca caccatgtcc cagagagaac 960 

gcagaggctg cgtttgtcat ctgtgggacc ctgtacgttg tctataacac ccgccctgcc 1020 

agtagggctc gtattcagtg ttccttcgat gccagtggta ctctcgcccc tgaaagggca 1080 

gcactctcct attttccacg ccgatatggt gcccatgcca gccttcgcta taacccccgt 1140 

gagcgccagc tgtatgcctg ggatgatggc taccagattg tctacaaatt ggagatgaag 1200 
aagaaggagg aggaagtt 1218 

<210> 179 

<211> 1218 

<212> DNA 

<213> Mus musculus 

<400> 179 

atggggccca gtgctcctct gctgctcctc ttctttttgt catggacggg accccttcag 60 

ggacagcagc accaccttgt ggagtacatg gaacgccgac tagctgcctt agaggaacgg 120 

ctggcccaat gccaggatca gagtagtcgg catgctgccg agcttcggga cttcaaaaac 180 

aagatgttgc ctctcctgga ggtggcagag aaggagcggg agaccctcag aactgaagca 240 

gactccatct caggaagagt ggaccgtctt gaaagggagg tagactatct ggagacacag 300 

aacccagctt tgccctgtgt agagctggat gagaaggtga ctggaggtcc tggagccaaa 3 60 

ggcaagggcc gaagaaatga gaaatacgat atagtgacgg actgtagcta cacagtcgct 420 

caggtgaggt caatgaagat cctgaagcgg tttggtggtt cagttggcct atggaccaag 4 80 

gatccgctgg ggccagcaga gaagatctac gtgttagacg gcacccagaa cgacacggct 540 

tttgtcttcc caaggctgcg tgacttcacc cttgccatgg ctgcccggaa agcttcccga 600 

attcgggtgc ccttcccctg ggtaggcacg gggcagctgg tgtacggtgg cttcctttat 660 

tatgctcgaa ggcctcctgg aggacctgga gggggtggtg aattggagaa cactctgcag 720 

ctgatcaaat ttcacttggc aaaccgaaca gtggtggata gctcagtgtt ccctgcagag 780 

agcctgatac ccccctacgg cctgacagca gatacatata tcgacctggc agctgatgag 84 0 

gagggcctgt gggctgtcta tgccactcga gatgatgaca ggcatttgtg tctagccaag 900 

ttagacccac agacacttga cacagagcag cagtgggaca caccatgtcc cagagagaac 960 

gcagaggctg cgtttgtcat ctgtgggacc ctgtacgttg tctataacac ccgccctgcc 1020 

agtagggctc gtattcagtg ttccttcgat gccagtggta ctctcgcccc tgaaagggca 1080 

gcactctcct attttccacg ccgatatggt gcccatgcca gccttcgcta taacccccgt 1140 

gagcgccagc tgtatgcctg ggatgatggc taccagattg tctacaaatt ggagatgaag 1200 
aagaaggagg aggaagtt 1218 

<210> 180 
<211> 1218 
<212> DNA 
<213> Mus musculus 

<400> 180 

atggggccca gtgctcctct gctgctcctc 

ggacagcagc accaccttgt ggagtacatg 

ctggcccaat gccaggatca gagtagtcgg 

aagatgttgc ctctcctgga ggtggcagag 

gactccatct caggaagagt ggaccgtctt 

aacccagctt tgccctgtgt agagctggat 

ggcaagggcc gaagaaatga gaaatacgat 

caggtgaggt caatgaagat cctgaagcgg 

gatccgctgg ggccagcaga gaagatctac 



ttctttttgt catggacggg accccttcag 60 

gaacgccgac tagctgcctt agaggaacgg 120 

catgctgccg agcttcggga cttcaaaaac 180 

aaggagcggg agaccctcag aactgaagca 240 

gaaagggagg tagactatct ggagacacag 300 

gagaaggtga ctggaggtcc tggagccaaa 360 

atggtgacgg actgtagcta cacagtcgct 42 0 

tttggtggtt cagttggcct atggaccaag 4 80 

gcgttagacg gcacccagaa cgacacggct 540 
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tttgtcttcc caaggctgcg tgacttcacc cttgccatgg ctgcccggaa agcttcccga 600 

attcgggtgc ccttcccctg ggtaggcacg gggcagctgg tgtacggtgg cttcctttat 660 

tatgctcgaa ggcctcctgg aggacctgga gggggtggtg aattggagaa cactctgcag 720 

ctgatcaaat ttcacttggc aaaccgaaca gtggtggata gctcagtgtt ccctgcagag 780 

agcctgatac ccccctacgg cctgacagca gatacatata tcgacctggc agctgatgag 840 

gagggcctgt.gggctgtcta tgccactcga gatgatgaca ggcatttgtg tctagccaag 900 

ttagacccac agacacttga cacagagcag cagtgggaca caccatgtcc cagagagaac 960 

gcagaggctg cgtttgtcat ctgtgggacc ctgtacgttg tctataacac ccgccctgcc 1020 

agtagggctc gtattcagtg ttccttcgat gccagtggta ctctcgcccc tgaaagggca 1080 

gcactctcct attttccacg ccgatatggt gcccatgcca gccttcgcta taacccccgt 114 0 

gagcgccagc tgtatgcctg ggatgatggc taccagattg tctacaaatt ggagatgaag 1200 

aagaaggagg aggaagtt 12 is 

<210> 181 
<211> 1218 
<212> DNA 

<213> Mus musculus 
<400> 181 

atggggccca gtgctcctct gctgctcctc ttctttttgt catggacggg accccttcag 

ggacagcagc accaccttgt ggagtacatg gaacgccgac tagctgcctt agaggaacgg 
ctggcccaat gccaggatca gagtagtcgg catgctgccg agcttcggga cttcaaaaac 

aagatgttgc ctctcctgga ggtggcagag aaggagcggg agaccctcag aactgaagca 
gactccatct caggaagagt ggaccgtctt gaaagggagg tagactatct ggagacacag 

aacccagctt tgccctgtgt agagctggat gagaaggtga ctggaggtcc tggagccaaa 

ggcaagggcc gaagaaatga gaaatacgat atggtgacgg actgtagcta cacagtcgct 

caggtgaggt caatgaagat cctgaagcgg tttggtggtt cagttggcct atggaccaag 

gatccgctgg ggccagcaga gaagatctac gtgttagacg gcacccagaa cgacacggct 

tttgtcttcc caaggctgcg tgacttcacc cttgtcatgg ctgcccggaa agcttcccga 

attcgggtgc ccttcccctg ggtaggcacg gggcagctgg tgtacggtgg cttcctttat 

tatgctcgaa ggcctcctgg aggacctgga gggggtggtg aattggagaa cactctgcag 

ctgatcaaat ttcacttggc aaaccgaaca gtggtggata gctcagtgtt ccctgcagag 

agcctgatac ccccctacgg cctgacagca gatacatata tcgacctggc agctgatgag 

gagggcctgt gggctgtcta tgccactcga gatgatgaca ggcatttgtg tctagccaag 

ttagacccac agacacttga cacagagcag cagtgggaca caccatgtcc cagagagaac 

gcagaggctg cgtttgtcat ctgtgggacc ctgtacgttg tctataacac ccgccctgcc 

agtagggctc gtattcagtg ttccttcgat gccagtggta ctctcgcccc tgaaagggca 

gcactctcct attttccacg ccgatatggt gcccatgcca gccttcgcta taacccccgt 

gagcgccagc tgtatgcctg ggatgatggc taccagattg tctacaaatt ggagatgaag 
aagaaggagg aggaagtt 

<210> 182 . 

<211> 1110 

<212> DNA 

<213> Homo sapiens 
<220> 

<221> modif ied_base 

<222> all "n M positions 

<223> n=a, c, g, or t 

<400> 182 

atgatttccc tcccggggcc cctggtgacc aacttgntgc ggtttttgtt cctggggctg 60 

agtgccctcg cgcccccctc gcgggcccag ctgcaactgc acttgcccgc caaccggttg 120 

caggcggtgg aggaggggga aagtggtgct tcagcatggt acaccttgca cagggaggcg 180 

tcttcatccc agccatggga ggtgcccttt gtgatgtggt tcttcaaaca gaaagaaaag 240 

gaggatcagg tgttgtccta catcaatggg gtcacaacaa gcaaacctgg agtatccttg 300 

gtctactcca tgccctcccg gaacctgtcc ctgcgggtgg agggtctcca ggagaaagac 360 



60 
120 
180 
240 
300 
360 
420 
4 80 - 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 . 
1200 
1218 
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tctggcccct 
agcatcaaaa 
cagggtgtgc 
cccgctgtcc 
gcattagatg 
gtctatgtct 
gtgagcacag 
ctggggttgc 
ccagccaatg 
tcagacacaa 
ccaccccatg 
gccctgccct 
catccctggt 



acagctgctc 
ccttagaact 
cccatgtggg 
aataccagtg 
tcatccgtgg 
gcaaggccca 
ggcctggagc 
tggctgggct 
atatcaagga 
tctccaagaa 
gccctcccag 
caccaagaca 
ggggtttttt 



cgtgaatgtg caagacaaac 
caatgtactg gttcctccag 
ggcaaacgtg accctgagct 
ggatcggcag cttccatcct 
gtctttaagc ctcaccaacc 
caatgaggtg ggcactgccc 
tgcagtggtt gctgaagctg 
ggtcctcttg taccaccgcc 
ggatgccatt gctccccgga 
tgggaccctt tcctctgtca 
gcctggtgca ttgaccccca 
tgcccacgac agatggggcc 
cctttggctt 



aaggcaaatc taggggccac 
ctcctccatc ctgccgtctc 
gccagtctcc aaggagtaag 
tccagacttt ctttgcacca 
tttcgtcttc catggctgga 
aatgtaatgt gacgctggaa 
ttgtgggtac cctggttgga 
ggggcaaggc cctggaggag 
ccctgccctg gcccaagagc 
cctccgcacg agccctccgg 
cgcccagtct atccagccag 
caccctcaac caatatcccc 



420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1110 



<210> 183 

<211> 1110 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> modif iedjbase 
<222> all "n" positions 
<223> n=a, c, g, or t 

<400> 183 

atgatttccc tcccggggcc cctggtgacc aacttgntgc ggtttttgtt cctggggctg 60 

agtgccctcg cgcccccctc gcgggcccag ctgcaactgc acttgcccgc caaccggttg 120 

caggcggtgg aggaggggga aagtggtgct tcagcatggt acaccttgca cagggaggtg 180 

tcttcatccc agccatggga ggtgcccttt gtgatgtggt tcttcaaaca gaaagaaaag 240 

gaggatcagg tgttgtccta catcaatggg gtcacaacaa gcaaacctgg agtatccttg 300 

gcctactcca tgccctcccg gaacctgtcc ctgcgggtgg agggtctcca ggagaaagac 360 

tctggcccct acagctgctc cgtgaatgtg caagacaaac aaggcaaatc taggggccac 420 

agcatcaaaa ccttagaact caatgtactg gttcctccag ctcctccatc ctgccgtctc 480 

cagggtgtgc cccatgtggg ggcaaacgtg accctgagct gccagtctcc aaggagtaag 540 

cccgctgtcc aataccagtg ggatcggcag cttccatcct tccagacttt ctttgcacca 600 

gcattagatg tcatccgtgg gtctttaagc ctcaccaacc tttcgtcttc catggctgga 660 

gtctatgtct gcaaggccca caatgaggtg ggcactgccc aatgtaatgt gacgctggaa 720 

gtgagcacag ggcctggagc tgcagtggtt gctgaagctg ttgtgggtac cctggttgga 780 

ctggggttgc tggctgggct ggtcctcttg taccaccgcc ggggcaaggc cctggaggag 840 

ccagccaatg atatcaagga ggatgccatt gctccccgga ccctgccctg gcccaagagc 900 

tcagacacaa tctccaagaa tgggaccctt tcctctgtca cctccgcacg agccctccgg 960 

ccaccccatg gccctcccag gcctggtgca ttgaccccca cgcccagtct atccagccag 1020 

gccctgccct caccaagaca tgcccacgac agatggggcc caccctcaac caatatcccc 1080 

catccctggt ggggtttttt cctttggctt 1110 

<210> 184 

<211> 1110 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> modif ied_base 
<222> all "n" positions 
<223> n=a, c, g, or t 



<400> 184 

atgatttccc tcccggggcc cctggtgacc aacttgntgc 
agtgccctcg cgcccccctc gcgggcccag ctgcaactgc 



ggtttttgtt cctggggctg 
acttgcccgc caaccggttg 



60 
120 
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caggcggtgg aggaggggga aagtggtgct tcagcatggt acaccttgca cagggaggtg 180 

tcttcatccc agccatggga ggtgcccttt gtgatgtggt tcttcaaaca gaaagaaaag 240 

gaggatcagg tgttgtccta catcaatggg gtcacaacaa gcaaacctgg agtatccttg 300 

gtctactcca tgccctcccg gaacctgtcc ctgcgggtgg agggtctcca ggagaaagac 360 

tctggcccct acagctgctc cgtgaatgtg caagacaaac aaggcaaatc taggggccac 42 0 

agcatcaaaa ccttagaact caatgtactg gttcctccag ctcctccatc ctgccgtatc 480 

cagggtgtgc cccatgtggg ggcaaacgtg accctgagct gccagtctcc aaggagtaag 54 0 

cccgctgtcc aataccagtg ggatcggcag cttccatcct tccagacttt ctttgcacca 600 

gcattagatg tcatccgtgg gtctttaagc ctcaccaacc tttcgtcttc catggctgga 660 

gtctatgtct gcaaggccca caatgaggtg ggcactgccc aatgtaatgt gacgctggaa 720 

gtgagcacag ggcctggagc tgcagtggtt gctgaagctg ttgtgggtac cctggttgga 780 

ctggggttgc tggctgggct ggtcctcttg taccaccgcc ggggcaaggc cctggaggag 84 0 

ccagccaatg atatcaagga ggatgccatt gctccccgga ccctgccctg gcccaagagc 900 

tcagacacaa tctccaagaa tgggaccctt tcctctgtca cctccgcacg agccctccgg 960 

ccaccccatg gccctcccag gcctggtgca ttgaccccca cgcccagtct atccagccag 1020 

gccctgccct caccaagaca tgcccacgac agatggggcc caccctcaac caatatcccc 1080 

catccctggt ggggtttttt cctttggctt 1110 

<210> 185 
<211> 1110 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> modif ied_base 
<222> all "n" positions 
<223> n=a, c, g, or t 

<400> 185 

atgatttccc tcccggggcc cctggtgacc aacttgntgc ggtttttgtt cctggggctg 60 

agtgccctcg cgcccccctc gcgggcccag ctgcaactgc acttgcccgc caaccggttg 120 

caggcggtgg aggaggggga aagtggtgct tcagcatggt acaccttgca cagggaggtg 180 

tcttcatccc agccatggga ggtgcccttt gtgatgtggt tcttcaaaca gaaagaaaag 240 

gaggatcagg tgttgtccta catcaatggg gtcacaacaa gcaaacctgg agtatccttg 300 

gtctactcca tgccctcccg gaacctgtcc ctgcgggtgg agggtctcca ggagaaagac 360 

tctggcccct acagctgctc cgtgaatgtg caagacaaac aaggcaaatc taggggccac 42 0 

agcatcaaaa ccttagaact caatgtactg gttcctccag ctcctccatc ctgccgtctc 480 

cagggtgtgc cccatgtggg ggcaaacgtg accctgagct gccagtctcc aaggagtaag 540 

cccgttgtcc aataccagtg ggatcggcag cttccatcct tccagacttt ctttgcacca 600 

gcattagatg tcatccgtgg gtctttaagc ctcaccaacc tttcgtcttc catggctgga 660 

gtctatgtct gcaaggccca caatgaggtg ggcactgccc aatgtaatgt gacgctggaa 720 

gtgagcacag ggcctggagc tgcagtggtt gctgaagctg ttgtgggtac cctggttgga 780 

ctggggttgc tggctgggct ggtcctcttg taccaccgcc ggggcaaggc cctggaggag 840 

ccagccaatg atatcaagga ggatgccatt gctccccgga ccctgccctg gcccaagagc 900 

tcagacacaa tctccaagaa tgggaccctt tcctctgtca cctccgcacg agccctccgg 960 

ccaccccatg gccctcccag gcctggtgca ttgaccccca cgcccagtct atccagccag 1020 

gccctgccct caccaagaca tgcccacgac agatggggcc caccctcaac caatatcccc 1080 

catccctggt ggggtttttt cctttggctt 1110 

<210> 186 
<211> 1182 
<212> DNA 

<213> Mus musculus 
<400> 186 

atgattcttc aggctggaac ccccgagacc agcttgctgc gggttttgtt cctgggactg 60 

agtacccttg ctgccttctc ccgagctcag atggagttgc acgtgccccc gggcctcaac 120 

aaattggaag cggtagaggg agaagaagtg gtgctccccg cctggtacac gatggcacgg 180 
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gaggagtcgt ggtcccaccc ccgggaggtg cccatcatga tctggttctt ggaacaagaa 240 

gggaaggaac caaaccaggt gttgtcttac attaatggag tcatgacaaa taaacctgga 300 

acagccctgg tccactctat ctcttcacgg aatgtgtccc tgcgcctggg ggcactccag 360 

gagggagact ctgggactta ccgctgttct gtcaatgtgc agaatgatga aggcaaaagt 420 

ataggccaca gcatcaaaag catagagctc aaagtgctgg ttcctccagc tcctccatcc 480 

tgtagtttac agggtgtacc ctatgtcggg accaatgtga ccctgaactg caagtcccca 540 

aggagtaaac ctactgctca gtaccagtgg gagaggctgg ccccatcctc ccaggtcttc 600 

tttggaccag ccttagatgc tgttcgtgga tctttaaagc tcactaacct ttccattgcc 660 

atgtctggag tctatgtctg caaggctcaa aacagagtgg gctttgccaa gtgcaacgtg 720 

accttggacg tgatgacagg gtccaaggct gcagtggtcg ctggagcagt tgtgggcact 780 

tttgttgggt tggtgctgat agctgggctg gtcctgttgt accagcgccg gagcaagacc 840 

ttggaagagc tggccaatga tatcaaggaa gatgccattg ctccccggac cttgccttgg 900 

accaaaggct cagacacaat ctccaagaat gggacacttt cttcggtcac ctcagcacga 960 

gctctgcggc cacccaaggc tgctcctcca agacctggca catttactcc cacacccagt 1020 

gtctctagcc aggccctgtc ctcaccaaga ctgcccaggg tagatgaacc cccacctcag 1080 

gcagtgtccc tgaccccagg tggggtttct tcttctgctc tgagccgcat gggtgctgtg 1140 

cctgtgatgg tgcctgcaca gagtcaggct gggtctcttg tg 1182 

<210> 187 
<211> 1182 
<212> DNA 

<213> Mus musculus 



<400> 187 

atgattcttc aggctggaac ccccgagacc agcttgctgc gggttttgtt cctgggactg 60 

agtacccttg ctgccttctc ccgagctcag atggagttgc acgtgccccc gggcctcaac 120 

aaattggaag cggtagaggg agaagaagtg gtgctccccg cctggtacac gatggcacgg 180 

gaggagtcgt ggtcccaccc ccgggaggtg cccatcctga tctggttctt ggaacaagaa 24 0 

gggaaggaac caaaccaggt gttgtcttac attaatggag tcatgacaaa taaacctgga 300 

acagccctgg tccactctat ctcttcacgg aatgtgtccc tgcgcctggg ggcactccag 360 

gagggagact ctgggactta ccgctgttct gtcaatgtgc agaatgatga aggcaaaagt 42 0 

ataggccaca gcatcaaaag catagagctc aaagcgctgg ttcctccagc tcctccatcc 480 

tgtagtttac agggtgtacc ctatgtcggg accaatgtga ccctgaactg caagtcccca 540 

aggagtaaac ctactgctca gtaccagtgg gagaggctgg ccccatcctc ccaggtcttc 600 

tttggaccag ccttagatgc tgttcgtgga tctttaaagc tcactaacct ttccattgcc 660 

atgtctggag tctatgtctg caaggctcaa aacagagtgg gctttgccaa gtgcaacgtg 720 

accttggacg tgatgacagg gtccaaggct gcagtggtcg ctggagcagt tgtgggcact 780 

tttgttgggt tggtgctgat agctgggctg gtcctgttgt accagcgccg gagcaagacc 84 0 

ttggaagagc tggccaatga tatcaaggaa gatgccattg ctccccggac cttgccttgg 900 

accaaaggct cagacacaat ctccaagaat gggacacttt cttcggtcac ctcagcacga 960 

gctctgcggc cacccaaggc tgctcctcca agacctggca catttactcc cacacccagt 1020 

gtctctagcc aggccctgtc ctcaccaaga ctgcccaggg tagatgaacc cccacctcag 1080 

gcagtgtccc tgaccccagg tggggtttct tcttctgctc tgagccgcat gggtgctgtg 1140 

cctgtgatgg tgcctgcaca gagtcaggct gggtctcttg tg 1182 

<210> 188 
<211> 1182 
<212> DNA 

<213> Mus musculus 



<400> 188 

atgattcttc aggctggaac ccccgagacc agcttgctgc gggttttgtt cctgggactg 60 

agtacccttg ctgccttctc ccgagctcag atggagttgc acgtgccccc gggcctcaac 12 0 

aaattggaag cggtagaggg agaagaagtg gtgctccccg cctggtacac gatggcacgg 180 

gaggagtcgt ggtcccaccc ccgggaggtg cccatcctga tctggttctt ggaacaagaa 24 0 

gggaaggaac caaaccaggt gttgtcttac attaatggag tcatgacaaa taaacctgga 300 

acagccctgg tccactctat ctcttcacgg aatgtgtccc tgcgcctggg ggcactccag 360 

gagggagact ctgggactta ccgctgttct gtcaatgtgc agaatgatga aggcaaaagt 420 
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ataggccaca gcatcaaaag catagagctc aaagtgctgg ttcctccagc tcctccatcc 4 80 

tgtagtttac agggtgtacc ctatgtcggg accaatgtga ccctgaactg caagtcccca 54 0 

aggagtaaac ctactgctca gtaccagtgg gagaggctgg tcccatcctc ccaggtcttc 600 

tttggaccag ccttagatgc tgttcgtgga tctttaaagc tcactaacct ttccattgcc 660 

atgtctggag tctatgtctg caaggctcaa aacagagtgg gctttgccaa gtgcaacgtg 720 

accttggacg tgatgacagg gtccaaggct gcagtggtcg ctggagcagt tgtgggcact 780 

tttgttgggt tggtgctgat agctgggctg gtcctgttgt accagcgccg gagcaagacc 84 0 

ttggaagagc tggccaatga tatcaaggaa gatgccattg ctccccggac cttgccttgg 900 

accaaaggct cagacacaat ctccaagaat gggacacttt cttcggtcac ctcagcacga 960 

gctctgcggc cacccaaggc tgctcctcca agacctggca catttactcc cacacccagt 1020 

gtctctagcc aggccctgtc ctcaccaaga ctgcccaggg tagatgaacc cccacctcag 1080 

gcagtgtccc tgaccccagg tggggtttct tcttctgctc tgagccgcat gggtgctgtg 114 0 

cctgtgatgg tgcctgcaca gagtcaggct gggtctcttg tg 1182 

<210> 189 

<211> 1182 

<212> DNA 

<213> Mus musculus 

<400> 189 

atgattcttc aggctggaac ccccgagacc agcttgctgc gggttttgtt cctgggactg 60 

agtacccttg ctgccttctc ccgagctcag atggagttgc acgtgccccc gggcctcaac 120 

aaattggaag cggtagaggg agaagaagtg gtgctccccg cctggtacac gatggcacgg 180 

gaggagtcgt ggtcccaccc ccgggaggtg cccatcctga tctggttctt ggaacaagaa 240 

gggaaggaac caaaccaggt gttgtcttac attaatggag tcatgacaaa taaacctgga 300 

acagccctgg tccactctat ctcttcacgg aatgtgtccc tgcgcctggg ggcactccag 360 

gagggagact ctgggactta ccgctgttct gtcaatgtgc agaatgatga aggcaaaagt 420 

ataggccaca gcatcaaaag catagagctc aaagtgctgg ttcctccagc tcctccatcc 480 

tgtagtttac agggtgtacc ctatgtcggg accaatgtga ccctgaactg caagtcccca 540 

aggagtaaac ctactgctca gtaccagtgg gagaggctgg ccccatcctc ccaggtcttc 600 

tttggaccag ccttagatgc tgttcgtgga tctttaaagc tcactaacct ttccattgcc 660 

atgtctggag tctatgtctg caaggctcaa aacagagtgg gctttgccaa gtgcaacgtg 720 

accttggacg tgatgacagg gtccaaggct gcagtggtcg ctggagcagt tgtgggcact 780 

tttgttgggt tggtgctgat agctgggctg gtcctgttgt accagcgccg gagcaagacc 84 0 

ttggaagagc tggccaatga tatcaaggaa gatgccattg ctccccggac cttgccttgg 900 

accaaaggct cagacacaat ctccaagaat gggacacttt cttcggtcac ctcagcacga 960 

gctctgcggc cacccaaggc tgctcctcca agacctggca catttactcc cacacccagt 102 0 

gtctctagcc aggccctgtc ctcaccaaga ctgcccaggg tagatgaacc cccacctcag 1080 

gcagtgtccc tgaccccagg tggggtttct tcttctgttc tgagccgcat gggtgctgtg 1140 

cctgtgatgg tgcctgcaca gagtcaggct gggtctcttg tg 1182 

<210> 190 
<211> 735 
<212> DNA 
<213> Homo sapiens 

<400> 190 

atgcgtctgt ttgtccgtcc gtccgtccgt cccgccatgg ctgcgccggc gccctctccg 60 

tggacccttt cgctgctgct gttgttgcta ctgccgtctc cgggtgccca tggcgagctg 120 

tgcaggccct tcggtgaaga caattcgatc ccagagtcct gtcctgactt ctgttgtggc 180 

tcctgttcca gccaatactg ctgctctgac gtgctgaaga aaatccagtg gaatgaggaa 240 

atgtgccctg agccagagtc cagcagattt tccgcccacc cggagacacc agaacagctg 300 

ggttcagtgc tgaagtatca gtccagtctt gacagtgaca acatgccagg gttcggagcg 360 

accgtggcca tcggcctgac cgtcttcgtg gtgtttatcg ctaccatcat tgtgtgcttt 420 

acctgctcct gctgctgtct atataagatg tgctgccgcc cacgacctgt cgtgtccaac 480 

accacaacta ctaccgtggt tcacaccgct taccctcagc ctcaacctgt ggcccccagc 540 

tatcctggac caacatacca gggctaccat cccatgcccc cccagccagg aatgccagca 600 

gcaccctacc caacgcagta ccctccaccc tacctggccc agcccacagg gccaccagcc 660 
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tatcatgaga cgttggctgg agccagccag cctccataca acccggccta catggatccc 72 0 

ccaaaggcag ttccc 735 

<210> 191 
<211> 735 
<212> DNA 

<213> Homo sapiens 
<400> 191 

atgcgtctgt ttgtccgtcc gtccgtccgt cccgccatgg ctgcgccggc gccctctccg 60 

tggacccttt cgctgctgct gttgttgcta ctgccgtctc cgggtgccca tggcgagctg 120 

tgcaggccct tcggtgaaga caattcgatc ccagagtcct gtcctgactt ctgttgtggc 180 

tcctgttcca gccaatactg ctgctctgac gtgctgaaga aaatccagtg gaatgaggaa 240 

atgtgccctg agccagagtc cagcagattt tccgcccacc cggagacacc agaacagctg 300 

ggttcagcgc tgaagtatca gtccagtctt gacagtgaca acatgccagg gttcggagcg 360 

accgtggcca tcggcctgac cgtcttcgtg gtgtttatcg ctaccatcat tgtgtgcttt 420 

acctgctcct gctgctgtct atataagatg tgctgccgcc cacgacctgt cgtgtccaac 480 

accacaacta ctaccgcggt tcacaccgct taccctcagc ctcaacctgt ggcccccagc 540 

tatcctggac caacatacca gggctaccat cccatgcccc cccagccagg aatgccagca 600 

gcaccctacc caacgcagta ccctccaccc tacctggccc agcccacagg gccaccagcc 660 

tatcatgaga cgttggctgg agccagccag cctccataca acccggccta catggatccc 72 0 

ccaaaggcag ttccc 735 

<210> 192 
<211> 735 
<212> DNA 

<213> Homo sapiens 
<400> 192 

atgcgtctgt ttgtccgtcc gtccgtccgt cccgccatgg ctgcgccggc gccctctccg 60 

tggacccttt cgctgctgct gttgttgcta ctgccgtctc cgggtgccca tggcgagctg 12 0 

tgcaggccct tcggtgaaga caattcgatc ccagagtcct gtcctgactt ctgttgtggc 180 

tcctgttcca gccaatactg ctgctctgac gtgctgaaga aaatccagtg gaatgaggaa 24 0 

atgtgccctg agccagagtc cagcagattt tccgcccacc cggagacacc agaacagctg 300 

ggttcagcgc tgaagtatca gtccagtctt gacagtgaca acatgccagg gttcggagcg 360 

accgtggcca tcggcctgac cgtcttcgtg gtgtttatcg ctaccatcat tgtgtgcttt 420 

acctgctcct gctgctgtct atataagatg tgctgccgcc cacgacctgt cgtgtccaac 480 

accacaacta ctaccgtggt tcacaccgct taccctcagc ctcaacctgt ggcccccagc 540 

tatcctggac caacatacca gggctaccat cccatgcccc cccagccagg aatgccagca 600 

gtaccctacc caacgcagta ccctccaccc tacctggccc agcccacagg gccaccagcc 660 

tatcatgaga cgttggctgg agccagccag cctccataca acccggccta catggatccc 72 0 

ccaaaggcag ttccc 735 

<210> 193 
<211> 22 
<212> DNA . 

<213> Artificial Sequence 
<220> 

<223> forward primer 
<400> 193 

caaagtgagc tcatgctctc ac 22 

<210> 194 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
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<220> 



<223> reverse primer 



<400> 194 
ctctggtctt gggcagaaat c 



21 



<210> 195 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> forward primer 
<400> 195 

ggagtatcct tggtctactc c 21 

<210> 196 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> reverse primer 
<400> 196 

gaaagtctgg aaggatggaa get 23 

<210> 197 

<211> 22 

<212> DNA 

<213> Artificial Sequence 



<210> 198 

<211> 22 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> reverse primer 

<400> 198 

ggaacattga gggttttgac tc 22 



<220> 

<223> forward primer 



<400> 197 
ggatgatggc taccagattg tc 



22 
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Box I Observations where certain claims were found unsearchable (Continuation of Hem 1 of first sheet) 



This international report has not been established in respect of certain claims under Article 17(2) (a) for the following reasons: 
1. f~| Claims Nos.: 

1 — 1 because they relate to subject matter not required to be searched by this Authority, namely: 



2. I I Claims Nos.: 

because they relate to parts of the international application that do not comply with the prescribed requirements to such 
an extent that no meaningful international search can be carried out, specifically: 



3. Q Claims Nos.: 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 



Box II Observations where unity of invention is lacking (Continuation of Item 2 of first sheet) 



This International Searching Authority found multiple inventions in this international application, as follows: 
Please See Extra Sheet. 



1 . II As all required additional search fees were timely paid by the applicant, this international search report covers all searchable 

claims. 

2. | | As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 

of any additional fee. 

3 . || As only some of the required additional search fees were timely paid by the applicant , this international search report covers 

only those claims for which fees were paid, specifically claims Nos.: 



4. |"x] No required additional search fees were timely paid by the applicant. Consequently, this international search report is 
restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 
1-10, 12 and 18 



Remark on Protest jT] The additional search fees were accompanied by the applicant's protest. 

| | No protest accompanied the payment of additional search fees. 
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International application No. 
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B. FIELDS SEARCHED 

Electronic data bases consulted (Name of data base and where practicable terms used): 

Commercial Sequence Databases: GenEmbl, N_GeneseqJ6, IssuedJ>atcnts_NA, EST, a-geneseq36, 8wiss-prot38, 
strcmbll2, pir64, a-issued 
Sequences searched: SEQ ID NOS: 1-3 and 8-10 

BOX II. OBSERVATIONS WHERE UNITY OF INVENTION WAS LACKING 
This ISA found multiple inventions as follows: 

This application contains the following inventions or groups of inventions which are not so linked as to form a single 
inventive concept under PCT Rule 13.1. In order for all inventions to be searched, the appropriate additional search 
fees must be paid. 

Group I, claim(s)l-10, 12 and 18, in so far as they are drawn to human and mouse Tango 253, polynucleotides of SEQ 
ID NOS:l, 2, 8 and 9, vector, host cell, method of producing a protein and polypeptides of SEQ ID NOS:3-5 and 10- 
12. 

Groups II-IV, claim(s) 1-10, 12 and 18, in so far as they are drawn to the polynucleotides of distinct cDNA clones and 
encoded proteins of human and mouse Intercept 258, Tango 281 and Tango 257, listed in Tables 1-4, pages 59-63. 
Groups V-VIII, claim(s) 11 and 15, in so far as they arc drawn to antibodies and binding compounds to the polypeptides 
listed in groups I-IV, respectively. . 
Groups IX-XU, claim(s) 13, 14, 19, 20 and 22, in so far as they arc drawn to a method for detecting the presence of a 
polypeptide or a method for identifying a compound which binds to or modulates the activity of a polypeptide listed in 
groups I-IV, respectively. . , . 

Groups XIII-XVI, claim(s) 16 and 17, in so far as they are drawn to a method for detecting the presence of a nucleic 
acid molecule listed in groups I-IV, respectively. 

Groups XVII-XX, claim 21, in so far as it is drawn to a method for modulating the activity of a polypeptide listed in 
groups I-IV, respectively. 



The inventions listed as Groups I-XX do not relate to a single inventive concept under PCT Rule 13.1 because, under 
PCT Rule 13.2, they lack the same or corresponding special technical features for the following reasons: Group I 
corresponds to the first invention wherein the first product is the polynucleotide and the first method of using is the 
method of making the protein. Note that there is no method of making the polynucleotide. The invention also includes 
the protein made. Each of groups II-IV does not share the same or corresponding special technical feature because each 
group is drawn to a different polynucleotide and encoded protein, and each of groups V-XX does not share the same or 
corresponding special technical feature because each group is drawn to different compounds or methods of using the 
four polynucleotides and encoded proteins. This Authority therefore considers that the several inventions do not share a 
special technical feature within the meaning of PCT Rule 13.2 and thus do not relate to a single general inventive 
concept within the meaning of PCT Rule 13.1. 



Form PCT/ISA/210 (extra sheet) (July 1998)* 



